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GENES IDENTIFIED AS REQUIRED FOR PROLIFERATION OF E. COLI 



BACKGROUND OF THE INVENTION 



Since the discovery of penicillin, the use of antibiotics to treat the ravages of bacterial 



5 infections has saved millions of lives. With the advent of these "miracle drugs" for a time it was 
popularly believed that humanity might, once and for all, be saved from the scourge of bacterial 
infections. In fact, during the 1980s and early 1990s, many large pharmaceutical companies cut 
back or eliminated antibiotics research and development They believed that infectious disease 
caused by bacteria finally had been conquered and that markets for new drugs were limited. 

1 0 Unfortunately, this belief was overly optimistic. 

The tide is be ginning to turn in favor of the bacteria as reports of drug resistant bacteria 
become more frequent The United States Centers for Disease Control announced that one of the most 
powerful known antibiotics, vancomycin, was unable to treat an infection of the common 
Staphylococcus aureus (staph). This organism is commonly found in our environment and is 

15 responsible for many nosocomial infections. The import of this announcement becomes clear when 
one considers that vancomycin was used for years to treat infections caused by stubborn strains of 
bacteria, like staph. In short, the bacteria are becoming resistant to our most powerful antibiotics. 
If this trend continues, it is conceivable that we will return to a time when what are presently 
considered minor bacterial infections are fatal diseases. 

20 There are a number of causes for the predicament in which practitioners of medical arts find 

themselves. Over-prescription and improper prescription habits by some physicians have caused an 
indiscriminate increase in the availability of antibiotics to the public. The patient is also partly 
responsible, for even in instances where an antibiotic is the appropriate treatment, patients will often 
improperly use the drug, the result being yet another population of bacteria that is resistant, in whole or 

25 in part, to traditional antibiotics. 



modern scientific practices to deal with the diseases that they cause. Drug resistant bacteria are now 
advancing on the health of humanity . A new generation of antibiotics to once again deal with the 



As more and more bacterial strains become resistant to the panel of available antibiotics, 
new compounds are required. In the past, practitioners of pharmacology would have to rely upon 
traditional methods of drug discovery to generate novel, safe and efficacious compounds for the 
treatment of disease. Traditional drug discovery methods involve blindly testing potential drug 
35 candidate-molecules, often selected at random, in the hope that one might prove to be an effective 
treatment for some disease. The process is painstaking and laborious, with no guarantee of success. 
Today, the average cost to discover and develop a new drug is nearly US $500 million, and the 



The bacterial scourges that have haunted humanity remain, in spite of the development of 




30 



Discovery of New Antibiotics 
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average time is 15 years from laboratory to patient. Improving this process, even incrementally, 
would represent a huge advance in the generation of novel antimicrobial agents. 

Newly emerging practices in drug discovery utilize a number of biochemical techniques to 
provide for directed approaches to creating new drugs, rather than discovering them at random. For 
5 example, gene sequences and proteins encoded thereby that are required for the proliferation of an 
organism make for excellent targets since exposure of bacteria to compounds active against these 
targets would result in the inactivation of the organism. Once a target is identified, biochemical 
analysis of that target can be used to discover or to design molecules that interact with and alter the 
functions of the target. Using physical and computational techniques, to analyze structural and 

10 biochemical targets in order to derive compounds that interact with a target is called rational drug 
design and offers great future potential. Thus, emerging drug discovery practices use molecular 
modeling techniques, combinatorial chemistry approaches, and other means to produce and screen 
and/or design large numbers of candidate compounds. 

Nevertheless, while this approach to drug discovery is clearly the way of the future, 

15 problems remain. For example, the initial step of identifying molecular targets for investigation can 
be an extremely time consuming task. It may also be difficult to design molecules that interact with 
the target by using computer modeling techniques. Furthermore, in cases where the function of the 
target is not known or is poorly understood, it may be difficult to design assays to detect molecules 
that interact with and alter the functions of the target To improve the rate of novel drug discovery 

20 and development, methods of identifying important molecular targets in pathogenic 
microorganisms and methods for identifying molecules that interact with and alter the functions of 
such molecular targets are urgently required. 

Escherichia coli represents an excellent model system to understand bacterial biochemistry 
and physiology. The estimated 4288 genes scattered along the 4.6 x 10 6 base pairs of the 

25 Escherichia coli (E coli) chromosome offer tremendous promise for the understanding of bacterial 
biochemical processes. In turn, this knowledge will assist in the development of new tools for the 
diagnosis and treatment of bacteria-caused human disease. The entire E. coli genome has been 
sequenced, and this body of information holds a tremendous potential for application to the 
discovery and development of new antibiotic compounds. Yet, in spite of this accomplishment, the 

30 general functions or roles of many of these genes are still unknown. For example, the total number 
of proliferation-required genes contained within die E. coli genome is unknown, but has been 
variously estimated at around 200 to 700 (Armstrong, K.A. and Fan, DJ\ Essential Genes in the 
metB-malB Region of Escherichia coli K12, 1975, J. Bacteriol. 126: 48-55). 

Novel, safe and effective antimicrobial compounds are needed in view of the rapid rise of 

35 antibiotic resistant microorganisms. However, prior to this invention, the characterization of even a 
single bacterial gene was a painstaking process, requiring years of effort Accordingly, there is an 
urgent need for more novel methods to identify and characterize bacterial genomic sequences that 
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encode gene products required for proliferation and for methods to identify molecules that interact 
with and alter the functions of such genes and gene products. 

SUMMARY OF THE INVENTION 
S One embodiment of the present invention is a purified or isolated nucleic acid sequence 

consisting essentially of one of the sequence of nucleotides of SEQ ID NOs: 1-93, wherein 
expression of said nucleic acid in a microorganism is capable of inhibiting the proliferation of a 
microorganism. The nucleic acid sequence may have as sequence of nucleotides complementary to 
at least a portion of the nucleotide sequence of the coding strand of a gene whose expression is 
10 required for proliferation of a microorganism. The nucleic acid may have a nucleotide sequence 
complementary to at least a portion of the nucleotide sequence of an RNA required for proliferation 
of a microorganism. The nucleotide sequence of the RNA may encode more than one gene product 
Another embodiment of the present invention is a purified or isolated nucleic acid 
comprising a fragment of one of the nucleotide sequences of SEQ ID NOs.: 1-93, said fragment 
IS selected from the group consisting of fragments comprising at least 10, at least 20, at least 25, at 
least 30, at least 50 and more than 50 consecutive nucleotides of the nucleotide sequence of one of 
SEQ ID NOs: 1-93. 

Another embodiment of the present invention is a vector comprising a promoter operably 
linked to the nucleic acid sequences of each of the preceding paragraphs. The promoter may be 

20 active in a microorganism selected from the group consisting of Aspergillus fumigatus, Bacillus 
anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida albicans, Candida glabrata (also 
called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii t 
Candida krusei, Candida kejyr (also called Candida pseudotropicalis), Candida dubliniensis, 
Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, Clostridium difficile, 

25 Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, 
Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, 
Salmonella typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, 

30 Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella somiei, 
Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema 
pallidum, Yersinia pestis and any species falling within the genera of any of the above species. 

Another embodiment of the present invention is a host cell containing the vectors of the 
preceding paragraph. 

35 Another embodiment of the present invention is a purified or isolated nucleic acid 

consisting essentially of the coding sequence of one of SEQ ID NOs: 106-112, 119-122, 134-160, 
164-171, 179-265, 271-273, 275, and 279-286. 
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Another embodiment of the present invention is a fragment of the nucleic acid of the 
preceding paragraph, said fragment comprising at least 10, at least 20, at least 25, at least 30, at 
least 50 or more than 50 consecutive nucleotides of one of SEQ ID NOs: 106-112, 1 19-122, 134- 
160, 164-171, 179-265, 271-273, 275, and 279-286. 
5 Another embodiment of the present invention is a vector comprising a promoter operably 

linked to the nucleic acid of the preceding two paragraphs. 

Another embodiment of the present invention is a purified or isolated antisense nucleic acid 
comprising a nucleic acid sequence complementary to at least a portion of an intragenic sequence, 
intergenic sequence, sequences spanning at least a portion of two or more genes, 5' noncoding 
10 region, or 3* noncoding region within an operon comprising a proliferation-required gene whose 
activity or expression is inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1- 
93. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
comprising a nucleic acid having at least 70% identity to a sequence selected from the group 

1 5 consisting of SEQ ID NOs.: 1-93, fragments comprising at least 25 consecutive nucleotides of SEQ 
ID NOs.: 1-93, the sequences complementary to SEQ ID NOs.: 1-93 and the sequences 
complementary to fragments comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-93 
as determined using BLASTN version 2.0 with the default parameters. The nucleic acid may be 
from an organism selected from the group consisting of Aspergillus jumigatus, Bacillus anthracis, 

20 Burkholderia cepacia, Campylobacter jejuni, Candida albicans, Candida glabrata (also called 
Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida 
krusei, Candida kefyr (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium botulinum, Clostridium difficile, Cryptococcus 
neoformans, Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, Haemophilus 

25 influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, Mycobacterium 
leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella 
cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella 
catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella jlexneri, Shigella sonnei, Pseudomonas 

30 aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, 
Yersinia pestis and any species falling within the genera of any of the above species. 

Another embodiment of the present invention is a vector comprising a promoter operably 
linked to a nucleic acid encoding a polypeptide whose expression is inhibited by an antisense 
nucleic acid comprising one of SEQ ID NOs.: 1-93. The polypeptide may comprise a polypeptide 

35 comprising a sequence selected from the group consisting of SEQ ID NOs: 299-305, 3 12-3 15, 327- 
353, 357-364, 372^58, 464-466, 468 and 472-479. 
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Another embodiment of the present invention is a host cell containing the vector of the 
preceding paragraph. 

Another embodiment of the present invention is a purified or isolated polypeptide 
comprising a polypeptide whose expression is inhibited by an antisense nucleic acid comprising one 
5 of SEQ ED NOs.: 1-93, or a fragment selected from the group consisting of fragments comprising at 
least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the said polypeptides. The polypeptide may comprise a 
polypeptide comprising one of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464- 
466, 468 and 472-479 or a fragment comprising at least 5, at least 10, at least 20, at least 30, at least 

10 40, at least 50, at least 60 or more than 60 consecutive amino acids of a polypeptide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357- 
364, 372-458, 464^t66, 468 and 472-479. 

Another embodiment of the present invention is a purified or isolated polypeptide 
comprising a polypeptide having at least 25% identity to a polypeptide whose expression is 

15 inhibited by a sequence selected from the group consisting of SEQ ID NOs.: 1-93, or at least 25% 
identity to a fragment comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 
50, at least 60 or more than 60 consecutive amino acids of a polypeptide whose expression is 
inhibited by a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-93 as determined 
using FASTA version 3. 0t78 with the default parameters. The polypeptide may have at least 25% 

20 identity to a polypeptide comprising one of SEQ ID NOs: 299-305, 312-315, 327-353, 357-364, 
372-458, 464-466, 468 and 472-479 or at least 25% identity to a fragment comprising at least 5, at 
least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 
amino acids of a polypeptide comprising one of SEQ ID NOs.: 299-305, 312-315, 327-353, 357- 
364, 372-458, 464-466, 468 and 472-479 as determined using FASTA version 3.0t78 with the 

25 default parameters. 

Another embodiment of the present invention is an antibody capable of specifically binding 
one of the polypeptides of the preceding paragraph. 

Another embodiment of the present invention is a method of producing a polypeptide, 
comprising introducing a vector comprising a promoter operably linked to a nucleic acid encoding a 

30 polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 
NOs.: 1-93 into a cell and expressing said polypeptide. The method may further comprise the step 
of isolating said polypeptide. The polypeptide may comprise a sequence selected from the group 
consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 and 472- 
479. 

35 Another embodiment of the present invention is a method of inhibiting proliferation of a 

microorganism comprising inhibiting the activity or reducing the amount of a gene product whose 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
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consisting of SEQ ID NOs.: 1-93 or inhibiting the activity or reducing the amount of a nucleic acid 
encoding said gene product The gene product may comprise a polypeptide comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 
464-466, 468 and 472-479. 
5 Another embodiment of the present invention is a method for identifying a compound 

which influences the activity of a gene product required for proliferation, said gene product 
comprising a gene product whose expression is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-93, said method comprising 
contacting said gene product with a candidate compound and determining whether said compound 

10 influences the activity of said gene product. The gene product may be a polypeptide and said 
activity may be an enzymatic activity. The gene product may be a polypeptide and said activity 
may be a carbon compound catabolism activity. The gene product may be a polypeptide and said 
activity may be a biosynthetic activity. The gene product may be a polypeptide and said activity 
may be a transporter activity. The gene product may be a polypeptide and said activity may be a 

15 transcriptional activity. The gene product may be a polypeptide and said activity may be a DNA 
replication activity. The gene product may be a polypeptide and said activity my be a cell division 
activity. The geije product may be a polypeptide comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 and 472- 
479. 

20 Another embodiment of the present invention is a compound identified using the methods 

of the preceding paragraph. 

Another embodiment of the present invention is a method for identifying a compound or 

nucleic acid having the ability to reduce the activity or level of a gene product required for 

proliferation, said gene product comprising a gene product whose activity or expression is inhibited 
25 by an antisense nucleic acid comprising a sequence selected from the group consisting of SEQ ID 

NOs.: 1-93, said method comprising: 

(a) providing a target that is a gene or RNA, wherein said target comprises a nucleic 
acid encoding said gene product; 

(b) contacting said target with a candidate compound or nucleic acid; and 
30 (c) measuring an activity of said target. 

The target may be a messenger RNA molecule and said activity may be translation of said 
messenger RNA. The target may be a messenger RNA molecule and said activity may be 
transcription of a gene encoding said messenger RNA. The target may be a gene and said activity 
may be transcription of said gene. Hie target may be a nontranslated RNA and said activity may be 
35 processing or folding of said nontranslated RNA or assembly of said nontranslated RNA into a 
protein/RNA complex. The target gene or RNA may encode a polypeptide comprising a sequence 
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selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 
464-466, 468 and 472-479. 

Another embodiment of the present invention is a compound or nucleic acid identified 
using the methods of the preceding paragraph. 
5 Another embodiment of the present invention is a method for identifying a compound 

which reduces the activity or level of a gene product required for proliferation of a microorganism, 
wherein the activity or expression of said gene product is inhibited by an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, said method 
comprising the steps of: 

10 (a) expressing a sub-lethal level of an antisense nucleic acid complementary to a 

nucleic acid encoding said gene product in a cell to reduce the activity or amount of said gene 
product in said cell, thereby producing a sensitized cell; 

(b) contacting said sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized cell. 

15 The determining step may comprise determining whether said compound inhibits the growth of said 
sensitized cell to a greater extent than said compound inhibits the growth of a nonsensitized cell. 
The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and 
animal cells. The cell may be a Gram negative bacterium. The cell may be an £ coli cell. The cell 
may be from an organism selected from the group consisting of Aspergillus fumigates, Bacillus 

20 anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida albicans, Candida glabrata (also 
called Tondopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, 
Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida dubliniensis, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium botulmum, Clostridium difficile, 
Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, 

25 Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, 
Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, 
Salmonella typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, 
Moxarella catarrhcdis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

30 Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema 
pallidum, Yersinia pestis and any species falling within the genera of any of the above species. The 
antisense nucleic acid may be transcribed from an inducible promoter. The method may further 
comprise the step of contacting said cell with a concentration of inducer which induces said 
antisense nucleic acid to a sub-lethal level. Growth inhibition may be measured by monitoring 

35 optical density of a culture growth solution. The gene product may be a polypeptide. The 
polypeptide may comprise a sequence selected from the group consisting of SEQ ID NOs.: 299- 
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305, 312-3 15, 327-353, 357-364, 372-458, 464466, 468 and 472-479. The gene product may be an 
RNA. 

Another embodiment of the present invention is a compound identified using the methods 
of the preceding paragraph. 
5 Another embodiment of the present invention is a method for inhibiting cellular 

proliferation comprising introducing a compound with activity against a gene whose activity or 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from die group 
consisting of SEQ ID NOs.: 1-93 or a compound with activity against the product of said gene into 
a population of cells expressing said gene. The compound may be an antisense nucleic acid 

1 0 comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, or a proliferation- 
inhibiting portion thereof. The proliferation inhibiting portion of one of SEQ ID NOs.: 1-93 may be 
a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 51 
consecutive nucleotides of one of SEQ ID NOs.: 1-93. The population may be a population 
selected from the group consisting of bacterial cells, fungal cells, plant cells, and animal cells. The 

15 population may be a population of Gram negative bacteria. The population may be a population of 
E. coli cells. The population may be a population selected from die group consisting of Aspergillus 
fumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida albicans, 
Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, 
Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), 

20 Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium Botulinum, 
Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, 
Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, 
Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 

25 Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria 
monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus 
pneumoniae, Treponema pallidum, Yersinia pestis and any species falling within the genera of any 
of the above species. The gene may encode a polypeptide comprising a sequence selected from the 

30 group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 
and 472-479. 

Another embodiment of the present invention is a preparation comprising an effective 
concentration of an antisense nucleic acid comprising a sequence selected from the group consisting 
of SEQ ID NOs.: 1-93, or a proliferation-inhibiting portion thereof in a pharmaceutical^ acceptable 
35 carrier. The proliferation-mhibitingportion of one of SEQ ID NOs.: 1-93 may comprise at least 10, 
at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive nucleotides of one of SEQ 
ID NOs.: 1-93. 
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Another embodiment of the present invention is a method for inhibiting the activity or 
expression of a gene in an operon required for proliferation wherein the activity or expression of at 
least one gene in said operon is inhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 1-93, said method comprising contacting a cell 
5 in a cell population with an antisense nucleic acid comprising at least a proliferation-inhibiting 
portion of said operon. The antisense nucleic acid comprises a sequence selected from the group 
consisting of SEQ ID NOs.: 1-93 or a proliferation inhibiting portion thereof. 
The method of Claim 68, wherein said cell is contacted with said antisense nucleic acid by 
introducing a plasmid which expresses said antisense nucleic acid into said cell population. The 

10 cell may be contacted with said antisense nucleic acid by introducing a phage which expresses said 
antisense nucleic acid into said cell population. The cell may be contacted with said antisense 
nucleic acid by expressing said antisense nucleic acid from the chromosome of cells in said cell 
population. The cell may be contacted with said antisense nucleic acid by introducing a promoter 
adjacent to a chromosomal copy of said antisense nucleic acid such that said promoter directs the 

1 5 synthesis of said antisense nucleic acid. The cell may be contacted with said antisense nucleic acid 
by introducing a retron which expresses said antisense nucleic acid into said cell population. The 
cell may be contacted with said antisense nucleic acid by introducing a ribozyme into said cell- 
population, wherein a binding portion of said ribozyme is complementary to said antisense 
oligonucleotide. Hie cell may be contacted with said antisense nucleic acid by introducing a 

20 liposome comprising said antisense oligonucleotide into said cell. Hie cell may be contacted with 
said antisense nucleic acid by electroporation of said antisense nucleic acid. The antisense nucleic 
acid may be a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more 
than 50 consecutive nucleotides of one of SEQ ID NOs.: 1-93. The antisense nucleic acid may be 
an oligonucleotide. 

25 Another embodiment of the present invention is a method for identifying a gene which is 

required for proliferation of a microorganism comprising: 

(a) contacting a microorganism other than E. coli with a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-93; 

(b) determining whether said nucleic acid inhibits proliferation of said microorganism; 

30 and 

(c) identifying the gene in said microorganism which is inhibited by said nucleic acid. 
The microorganism may be a Gram negative bacterium. The microorganism may be selected from 
the group consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 

35 Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kejyr 
(also called Candida pseudotropiccdis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatus, Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, 
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Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, Mycobacterium leprae, 
Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella 
cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
5 typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella 
catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas 
aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, 
Yersinia pestis and any species falling within the genera of any of the above species. The method 
may further comprise introducing said nucleic acid into a vector functional in said microorganism 
1 0 prior to introducing said inhibitory nucleic acid into said microorganism. 

Another embodiment of the present invention is a method for identifying a compound 
having the ability to inhibit proliferation of a microorganism comprising: 

(a) identifying in a first microorganism a homolog of a gene or gene product present in 
a second microorganism which is different than said first microorganism, wherein the activity or 

15 level of said gene or gene product is inhibited by a nucleic acid comprising a sequence selected 
from the group consisting of SEQ ID NOs. 1-93 ; 

(b) identifying an inhibitoiy nucleic acid sequence which inhibits the activity of said 
homolog in said first microorganism; 

(c) contacting said first microorganism with a sub-lethal level of said inhibitory nucleic 
20 acid, thus sensitizing said first microorganism; 

(d) contacting the sensitized microorganism of step (c) with a compound; and 

(e) determining whether said compound inhibits proliferation of said sensitized 
microorganism. 

The determining step may comprise determining whether said compound inhibits proliferation of 
25 said sensitized microorganism to a greater extent than said compound inhibits proliferation of a 
nonsensitized microorganism. Step (a) may comprise identifying a homologous nucleic acid to a 
gene or gene product whose activity or level is inhibited by a nucleic acid selected from the group 
consisting of SEQ ID NOs. 1-93 or a nucleic acid encoding a homologous polypeptide to a 
polypeptide whose activity or level is inhibited by a nucleic acid selected from the group consisting 
30 of SEQ ID NOs. 1-93 by using an algorithm selected from the group consisting of BLASTN 
version 2.0 with the default parameters and FASTA version 3.0t78 algorithm with the default 
parameters to identify said homologous nucleic acid or said nucleic acid encoding a homologous 
polypeptide in a database. Step (a) may comprise identifying a homologous nucleic acid or a 
nucleic acid encoding a homologous polypeptide by identifying nucleic acids which hybridize to 
35 said first gene. Step (a) may comprise expressing a nucleic acid selected from the group consisting 
of SEQ ID NOs. 1-93 in said microorganism. The inhibitory nucleic acid may be an antisense 
nucleic acid. The inhibitory nucleic acid may comprise an antisense nucleic acid to a portion of 
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said homology The inhibitory nucleic acid may comprise an antisense nucleic acid to a portion of 
the operon encoding said homolog. The step of contacting the first microorganism with a sub-lethal 
level of said inhibitory nucleic acid may comprise directly contacting said microorganism with said 
inhibitory nucleic acid. The step of contacting the first microorganism with a sub-lethal level of 
S said inhibitory nucleic acid may comprise expressing an antisense nucleic acid to said homolog in 
said microorganism. The gene product may comprise a polypeptide comprising a sequence selected 
from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464- 
466, 468 and 472-479. 

Another embodiment of the present invention is a compound identified using the method of 
10 the preceding paragraph. 

Another embodiment of the present invention is a method of identifying a compound 
having the ability to inhibit proliferation comprising: 

(a) contacting a microorganism other than E. coli with a sub-lethal level of a nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1-93 or a portion 

1 5 thereof which inhibits the proliferation of E. coli, thus sensitizing said microorganism; 

(b) contacting the sensitized microorganism of step (a) with a compound; and 

(c) determining whether said compound inhibits proliferation of said sensitized 
microorganism. 

The determining step may comprise determining whether said compound inhibits proliferation of 
20 said sensitized microorganism to a greater extent than said compound inhibits proliferation of a 
nonsensitized microorganism. 

Another embodiment of the present invention is a compound identified using the methods 
of the preceding paragraph. 

Another embodiment of the present invention is a method for identifying a compound 
25 having activity against a biological pathway required for proliferation comprising: 

(a) sensitizing a cell by expressing a sub-lethal level of an antisense nucleic acid 
complementary to a nucleic acid encoding a gene product required for proliferation, wherein the 
activity or expression of said gene product is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-93, in said cell to reduce the 

30 activity or amount of said gene product; 

(b) contacting the sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized cell. 
The determining step may comprise determining whether said compound inhibits the growth of said 
sensitized cell to a greater extent than said compound inhibits the growth of a nonsensitized cell. 

35 The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and 
animal cells. The cell may be a Gram negative bacterium. The Gram negative bacterium may be E. 
coli. The cell may be selected from the group consisting of Aspergillus fumigalus, Bacillus 
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anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida albicans, Candida glabrata (also 
called Tondopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, 
Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida dubliniensis, 
Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botodinum, Clostridium difficile, 
5 Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, 
Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, 
Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, 
Salmonella typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, 

10 Maxarella catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 
Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema 
pallidum, Yersinia pestis and any species falling within the genera of any of the above species. The 
antisense nucleic acid may be transcribed from an inducible promoter. The method may further 
comprise contacting the cell with an agent which induces expression of said antisense nucleic acid 

15 from said inducible promoter, wherein said antisense nucleic acid is expressed at a sub-lethal level. 
The inhibition of proliferation may be measured by monitoring the optical density of a liquid 
culture. The gene product may comprise a polypeptide comprising a sequence selected from the 
group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 
and 472-479. 

20 Another embodiment of the present invention is a compound identified using the methods 

of the preceding paragraph. 

Another embodiment of the present invention is a method for identifying a compound 
having the ability to inhibit cellular proliferation comprising: 

(a) contacting a cell with an agent which reduces the activity or level of a gene product 
25 required for proliferation of said cell, wherein said gene product is a gene product whose activity or 

expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ m NOs.: 1-93; 

(b) contacting said cell with a compound; and 

(c) determining whether said compound reduces proliferation of said contacted cell. 

30 The determining step may comprise determining whether said compound reduces proliferation of 
said contacted cell to a greater extent than said compound reduces proliferation of cells which have 
not been contacted with said agent The agent which reduces the activity or level of a gene product 
required for proliferation of said cell may comprise an antisense nucleic acid to a gene or operon 
required for proliferation. The agent which reduces the activity or level of a gene product required 

35 for proliferation of said cell may comprise a compound known to inhibit growth or proliferation of 
a microorganism. The cell may contain a mutation which reduces the activity or level of said gene 
product required for proliferation of said cell. The mutation may be a temperature sensitive 
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mutation. The gene product may comprise a polypeptide comprising a sequence selected from the 
group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 
and 472^79. 

Another embodiment of the present invention is a compound identified using the method of 
5 the preceding paragraph. 

Another embodiment of the present invention is a method for identifying the biological 
. pathway in which a proliferation-required gene or its gene product lies, wherein said gene or gene 
product comprises a gene or gene product whose activity or expression is inhibited by an antisense 
nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, said 
10 method comprising: 

(a) expressing a sub-lethal level of an antisense nucleic acid which inhibits the activity 
of said proliferation-required gene or gene product in a cell; 

(b) contacting said cell with a compound known to inhibit growth or proliferation of a 
microorganism, wherein the biological pathway on which said compound acts is known; and 

15 (c) determining whether said cell is sensitive to said compound. 

The determining step may comprise determining whether said cell has a substantially greater 
sensitivity to said compound than a cell which does not express said sub-lethal level of said 
antisense nucleic acid and wherein said gene or gene product lies in the same pathway on which 
said compound acts if said cell expressing said sub-lethal level of said antisense nucleic acid has a 

20 substantially greater sensitivity to said compound than said cell which does not express said sub- 
lethal level of said antisense nucleic acid. 

The gene product may comprise a polypeptide comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 and 472- . 
479. 

25 Another embodiment of the present invention is a method for determining the biological 

pathway on which a test compound acts comprising: 

(a) expressing a sub-lethal level of an antisense nucleic acid complementary to a 
proliferation-required nucleic acid in a cell, wherein the activity or expression of said proliferation- 
required nucleic acid is inhibited by an antisense nucleic acid comprising a sequence selected from 

30 the group consisting of SEQ ID NOs.: 1-93 and wherein the biological pathway in which said 
proliferation-required nucleic acid or a protein encoded by said proliferation-required polypeptide 
lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell is sensitive to said test compound. 

35 The determining step may comprise determining whether said cell has a substantially greater 
sensitivity to said test compound than a cell which does not express said sub-lethal level of said 
antisense nucleic acid. The method may further comprise: 
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(d) expressing a sub-lethal level of a second antisense nucleic acid complementary to a 
second proliferation-required nucleic acid in a second cell, wherein said second proliferation- 
required nucleic acid is in a different biological pathway than said proliferation-required nucleic 
acid in step (a); and 

5 (e) determining whether said second cell does not have a substantially greater 

sensitivity to said test compound than a cell which does not express said sub-lethal level of said 

second antisense nucleic acid, wherein said test compound is specific for die biological pathway 

against which the antisense nucleic acid of step (a) acts if said second cell does not have 

substantially greater sensitivity to said test compound. 
10 Another embodiment of the present invention is a purified or isolated nucleic acid 

comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93. 

Another embodiment of the present invention is a compound which interacts with a gene or 

gene product whose activity or expression is inhibited by an antisense nucleic acid comprising one 

of SEQ ID NOs.: 1-93 to inhibit proliferation. 
15 Another embodiment of the present invention is a compound which interacts with a 

polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 

NOs.: 1-93 to inhibit proliferation. 

Another embodiment of the present invention is a method for manufacturing an antibiotic 

comprising the steps of screening one or more candidate compounds to identify a compound that 
20 reduces the activity or level of a gene product required for proliferation, said gene product 

comprising a gene product whose activity or expression is inhibited by an antisense nucleic acid 

comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93 and 

manufacturing the compound so identified. 

The screening step may comprise performing any one of the methods of identifying a compound 

25 described above. 

Another embodiment of the present invention is a method for inhibiting proliferation of a 
microorganism in a subject comprising administering a compound that reduces the activity or level 
of a gene product required for proliferation of said microorganism, said gene product comprising a 
gene product whose activity or expression is inhibited by an antisense nucleic acid comprising a 

30 sequence selected from the group consisting of SEQ ED NOs.: 1-93 to said subject The method of 
subject may be selected from the group consisting of vertebrates, mammals, avians, and human 
beings. The gene product may comprise a polypeptide, comprising a sequence selected from the 
group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 
and 472-479. 

35 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is an IPTG dose response curve in R coli transformed with an IPTG-inducible 
plasmid containing either an antisense clone to the E. coli rplW gene (AS-rplW) which encodes a 
ribosomal protein required for protein synthesis and essential for cell proliferation, or an antisense 
5 clone to the elaD gene (AS-elaD) which is not known to be involved in protein synthesis and which 
is also essential for proliferation. 

Figure 2A is a tetracycline dose response curve in R coli transformed with an IPTG- 
inducible plasmid containing antisense to the rptW gene (AS-rplW) which was carried out in the 
presence of 0, 20 or 50 |iM IPTG. 
10 Figure 2B is a tetracycline dose response curve in E. coli transformed with an IPTG- 

inducible plasmid containing antisense to the elaD gene (AS-elaD) which was carried out in the 
presence of 0, 20 or 50 jiM IPTG. 

Figure 3 is a graph showing the fold increase in tetracycline sensitivity of R coli 
transfected with antisense clones to essential ribosomal protein genes L23 (AS-rplW) and L7/L12 
15 and L10 (AS-rplLrpU). Antisense clones to genes known not to be involved in protein synthesis 
(atpB/E(AS-atpB/E ), visC (AS-vwC), elaD (A$-elaD) 9 yohH (AS-yohH) are much less sensitive to 
tetracycline. 

Definitions 

20 By "biological pathway" is meant any discrete cell function or process that is carried out by 

a gene product or a subset of gene products. Biological pathways include enzymatic, biochemical 
and metabolic pathways as well as pathways involved in the production of cellular structures such 
as cell walls. Biological pathways that are usually required for proliferation of microorganisms 
include, but are not limited to, cell division, DNA synthesis and replication, RNA synthesis 

25 (transcription), protein synthesis (translation), protein processing, protein transport, fatty acid 
biosynthesis, cell wall synthesis, cell membrane production, synthesis and maintenance, and the 
like. 

By "inhibit activity of a gene or gene product" is meant having the ability to interfere with 
the function of a gene or gene product in such a way as to decrease expression of the gene or to 

30 reduce the level or activity of a product of the gene. Agents which inhibit the activity of a gene 
include agents that inhibit transcription of the gene, agents that inhibit processing of the transcript 
of the gene, agents that reduce the stability of the transcript of the gene, and agents that inhibit 
translation of the mRNA transcribed from the gene. In microorganisms, agents which inhibit the 
activity of a gene can act to decrease expression of the operon in which the gene resides or alter the 

35 folding or processing of operon RNA so as to reduce the level or activity of the gene product The 
gene product can be a non-translated RNA such as ribosomal RNA, a translated RNA (mRNA) or 
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the protein product resulting from translation of the gene mRNA. Of particular utility to the present 
invention are antisense RNAs that have activities against the operons or genes to which they 
specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the function or to 
5 reduce the level or activity of the gene product in a cell. 

By "activity against a protein" is meant having the ability to inhibit the function or to 
reduce the level or activity of the protein in a cell. 

By "activity against a nucleic acid" is meant having the ability to inhibit the function or to 
reduce the level or activity of the nucleic acid in a celL 
10 By "activity against a gene" is meant having the ability to inhibit the function or expression 

of the gene in a cell. 

By "activity against an bperon" is meant having the ability to inhibit the function or reduce 
the level of one or more products of the operon in a cell. 

By "antibiotic" is meant an agent which inhibits the proliferation of a microorganism. 

15 By "E. coli or Escherichia coli" is meant Escherichia coli or any organism previously 

categorized as a species of Shigella including Shigella boydii, Shigella flexneri, Shigella 
dysenteriae, Shigella sonnei, Shigella 2A. 

By "identifying a compound" is meant to screen one or more compounds in a collection of 
compounds such as a combinatorial chemical library or other library of chemical compounds or to 

20 characterize a single compound by testing the compound in a given assay and determining whether 
it exhibits the desired activity. 

By "inducer" is meant an agent or solution which, when placed in contact with a 
microorganism, increases transcription from a desired promoter. 

As used herein, "nucleic acid" means DNA, RNA, or modified nucleic acids. Thus, the 

25 terminology "the nucleic acid of SEQ ID NO: X" includes both the DNA sequence of SEQ ID NO: 
X and an RNA sequence in which the thymidines in the DNA sequence have been substituted with 
uridines in the RNA sequence and in which the deoxyribose backbone of the DNA sequence has 
been substituted with a ribose backbone in the RNA sequence. Modified nucleic acids are nucleic 
acids having nucleotides or structures which do not occur in nature, such as nucleic acids in which 

30 the internucleotide phosphate residues with methylphosphonates, phosphorothioales, 
phosphoramidates, and phosphate esters. Nonphosphate internucleotide analogs such as siloxane 
bridges, carbonate brides, thioester bridges, as well as many others known in the art may also be 
used in modified nucleic acids. Modified nucleic acids may also comprise, a-anomeric nucleotide 
units and modified nucleotides such as 1,2-dideoxy-d-ribofuranose, 1,2-dideoxy-l- 

35 phenylribofiiranose, and JV 4 , iV'-ethano-S-methyl-cytosine are contemplated for use in the present 
invention. Modified nucleic acids may also be peptide nucleic acids in which the entire 
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deoxyribose-phosphate backbone has been exchanged with a chemically completely different, but 
structurally homologous, polyamide (peptide) backbone containing 2-aminoethyl glycine units. 

As used herein, "sub-lethal*' means a concentration of an agent below the concentration 
required to inhibit all cell growth. 
5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention describes a group of E. coli genes and gene families required for growth 
and/or proliferation. A proliferation-required gene or gene family is one where, in the absence of a 
gene transcript and/or gene product, growth or viability of the microorganism is reduced or eliminated. 
Thus, as used herein the terminology "proliferation-required" or "required for proliferation" 

10 encompasses instances where the absence of a gene transcript and/or gene product completely 
eliminates cell growth as well as instances where the absence of a gene transcript and/or gene product 
merely reduces cell growth. These proliferation-required genes can be used as potential targets for the 
generation of new antimicrobial agents. To achieve that goal, the present invention also encompasses 
novel assays for analyzing proliferation-required genes and for identifying compounds which interact 

15 with the gene products of the proliferation-required genes. In addition, the present invention 
contemplates the expression of genes and the purification of the proteins encoded by the nucleic acid 
sequences identified as required proliferation genes and reported herein. The purified proteins can be 
used to generate reagents and screen small molecule libraries or other candidate compound libraries for 
compounds that can be further developed to yield novel antimicrobial compounds. The present . 

20 invention also describes methods for identification of homologous genes or polypeptides in organisms 
other than E. colL 

The present invention utilizes a novel method to identify proliferation-required R coli 
sequences. Generally, a library of nucleic acid sequences from a given source are subcloned or 
otherwise inserted into an inducible expression vector, thus forming an expression library. Although 

25 the insert nucleic acids may be derived from the chromosome of the organism into which the 
expression vector is to be introduced, because the insert is not in its natural chromosomal location, the 
insert nucleic acid is an exogenous nucleic acid for the purposes of the discussion herein. The term 
expression is defined as the production of an RNA molecule from a gene, gene fragment, genomic 
fragment, or operon. Expression can also be used to refer to the process of peptide or polypeptide 

30 synthesis. An expression vector is defined as a vehicle by which a ribonucleic acid (RNA) sequence is 
transcribed from a nucleic acid sequence carried within the expression vehicle. The expression vector 
can also contain features that permit translation of a protein product from the transcribed RNA message 
expressed from the exogenous nucleic acid sequence carried by the expression vector. Accordingly, an 
expression vector can produce an RNA molecule as its sole product or the expression vector can 

3 5 produce a RNA molecule that is ultimately translated into a protein product 

Once generated, the expression library containing the exogenous nucleic acid sequences is 
introduced into an E. coli population to search for genes that are required for bacterial proliferation. 
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Because the library molecules are foreign to the population of E coli, the expression vectois and the 
nucleic acid segments contained therein are considered exogenous nucleic acid 

Expression of the exogenous nucleic acid fragments in the test population of K coli containing 
the expression vector library is then activated. Activation of the expression vectors consists of 
5 subjecting the cells containing the vectors to conditions that result in the expression of the exogenous 
nucleic acid sequences carried by the expression vector library. The test population of E. coli cells is 
then assayed to determine the effect of expressing the exogenous nucleic acid fragments on the test 
population of cells. Those expression vectors that, upon activation and expression, negatively impact 
the growth of the E coli screen population are identified, isolated, and purified for further study. 

10 A variety of assays are contemplated to identify nucleic acid sequences that negatively impact 

growth upon expression. In one embodiment, growth in E coli cultures expressing exogenous nucleic 
acid sequences is compared to growth in cultures not expressing these sequences. Optical density is 
used to monitor the extent of growth. Alternatively, enzymatic assays can be used to determine 
bacterial growth rates to identify exogenous nucleic acid sequences of interest. Colony size, colony 

15 morphology, and cell morphology are additional factors used to evaluate growth of the host cells. 
Those cultures that fail to grow or grow at a reduced rate under expression conditions are identified as 
containing an expression vector encoding a nucleic acid fragment that negatively affects a 
proliferation-required gene. 

Once exogenous nucleic acid sequences of interest are identified, they are analyzed. The first 

20 step of the analysis is to acquire the nucleic acid sequence of the nucleic acid fiagment of interest To 
achieve this end, the insert in those expression vectors identified as containing a sequence of interest is 
sequenced, using standard techniques well known in the art. The next step of the process is to 
determine the source of the nucleic acid sequence. 

Determination of sequence source is achieved by comparing the obtained sequence data with 

25 known sequences in various genetic databases. The sequences identified are used to probe these gene 
databases. The result of this procedure is a list of exogenous nucleic acid sequences corresponding to a 
list that includes novel bacterial genes required for proliferation as well as genes previously identified 
as required for proliferation. 

The number of DNA and protein sequences available in database systems has been growing 

30 exponentially for years. For example, at the end of 1998, the complete sequences of Caenorhabditis 
elegans, Saccharomyces cerevisiae and nineteen bacterial genomes, including E coli were available. 
This sequence information is stored in a number of databanks, such as GenBank (the National Center 
for Biotechnology Information (NCBI), and is publicly available for searching. 

A variety of computer programs are available to assist in the analysis of the sequences stored 

35 within these databases. FASTA, (W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison 
with FASTP and FASTA" Methods in Enzymology 183:63- 98), Sequence Retrieval System (SRS), 
(Etzold & Argos, SRS an indexing and retrieval tool for flat file data libraries. Comput Appl. 
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Biosci. 9:49-57, 1993) are two examples of computer programs that can be used to analyze 
sequences of interest In one embodiment of the present invention, the BLAST family of computer 
programs, which includes BLASTN version 2.0 with the default parameters, or BLASTX version 
2.0 with the default parameters, is used to analyze nucleic acid sequences. BLAST, an 

5 acronym for "Basic Local Alignment Search Tool," is a family of programs for database similarity 
searching. The BLAST family of programs includes: BLASTN, a nucleotide sequence database 
searching program, BLASTX, a protein database searching program where the input is a nucleic acid 
sequence; and BLASTP, a protein database searching program where die input is an amino acid 
sequence. BLAST programs embody a fast algorithm for sequence matching, rigorous statistical 

10 methods for judging the significance of matches, and various options for tailoring the program for 
special situations. Assistance in using the program can be obtained by e-mail at 
blastt@ncbi nlm tiih.pov. 

Bacterial genes are often transcribed in polycistronic groups. These groups comprise operons, 
which are a collection of genes and intergenic sequences. The genes of an operon are co-transcribed 

15 and often have related functions. Given the nature of the screening protocol, it is possible that the 
identified exogenous nucleic acid sequence corresponds to a gene or portion thereof with or without 
adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a gene), an intergenic 
sequence (i.e. a sequence between genes), a sequence spanning at least a portion of two or more genes, 
a 5' noncoding region or a 3 T noncoding region located upstream or downstream from the actual 

20 sequence that is required for bacterial proliferation. Accordingly, determining which gene(s)that is 
encoded within the operons is individually required for proliferation is often desirable. 

In one embodiment of the present invention, an operon is dissected to determine which gene or 
genes are required for proliferation. For example, the RegulonDB DataBase described by Huerta et al. 
(NucL Acids Res. 26:55-59, 1998), which may also be found on the website 

25 ht1p://ww.cifii.unam.mx/ComputationalJBiology/regulondb/4n^r be used to identify the boundaries 
of operons encoded within microbial genomes. A number of techniques that are well known in the art 
can then be used to dissect the operon. In one aspect of this embodiment, gene disruption by 
homologous recombination is used to individually inactivate the genes of an operon that is thought to 
contain a gene required for proliferation. 

30 Several gene disruption techniques have been described for the replacement of a functional 

gene with a mutated, non-functional (null) allele. These techniques generally involve the use of 
homologous recombination. The method described by Link et al. (J. Bacterid 1997 179:6228) 
serves as an excellent example of these methods as applicable to disruption of genes in K colL This 
technique uses crossover PCR to create a null allele with an in-frame deletion of the coding region 

35 of a target gene. The null allele is constructed in such a way that sequences adjacent to the wild 
type gene (ca. 500 bp) are retained. These homologous sequences surrounding the deletion null 
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allele provide targets for homologous recombination so that the wild type gene on the K coli 
chromosome can be replaced by the constructed null allele. 

The crossover PCR amplification product is subcloned into the vector pK03, the features of 
which include a chloramphenicol resistance gene, the counter-selectable marker sacB, and a 
5 temperature sensitive autonomous replication function. Following transformation of an K coli cell 
population with such a vector, selection for cells that have undergone homologous recombination of 
the vector into the chromosome is achieved by growth on chloramphenicol at the non-permissive 
temperature of 43°C. Under these conditions, autonomous replication of the plasmid cannot occur 
and cells are resistant to chloramphinicol only if the chloramphenicol resistance gene has been 

1 0 integrated into the chromosome. Usually a single crossover event is responsible for this integration 
event such that the E. coli chromosome now contains a tandem duplication of the target gene 
consisting of one wild type allele and one deletion null allele separated by vector sequence. 

This new E. coli strain containing the tandem duplication can be maintained at permissive 
temperatures in the presence of drug selection (chloramphenicol). Subsequently, cells of this new 

15 strain are cultured at the permissive temperature 30°C without drug selection. Under these 
conditions, the chromosome of some of the cells within the population will have undergone an 
internal homologous recombination event resulting in removal of the plasmid sequences. 
Subsequent culturing of the strain in growth medium lacking chloramphenicol but containing 
sucrose is used to select for such recombinative resolutions. In the presence of the counter- 

20 selectable marker sacB 9 sucrose is rendered into a toxic metabolite. Thus, cells that survive this 
counter-selection have lost both the plasmid sequences from the chromosome and the autonomously 
replicating plasmid that results as a byproduct of recombinative resolution. 

There are two possible outcomes of the above recombinative resolution via homologous 
recombination. Either the wild type copy of the targeted gene is retained on the chromosome or the 

25 mutated null allele is retained on the chromosome. In the case of an essential gene, a single copy of 
the null allele would be lethal and such cells should not be obtained by the above procedure when 
applied to essential genes. In the case of a non-essential gene, roughly equal numbers of cells 
containing null alleles and cells containing wild type alleles should be obtained. Thus, the method 
serves as a test for essentiality of the targeted gene: when applied to essential genes, only cells with 

30 a wild type allele on the chromosome will be obtained. 

Other techniques have also been described for the creation of disruption mutations in E; 
coli. For example, Link et aL also describe inserting an in-frame sequence tag concommitantly with 
an in-frame deletion in order to simplify analysis of recombinants obtained. Further, Link et al. 
describe disruption of genes with a drug resistance marker such as a kanamycin resistance gene. 

35 Arigoni et al., (Arigoni, F. et al. A Genome-based Approach for the Identification of Essential 
Bacterial Genes, Nature Biotechnology 16: 851-856) describe the use of gene disruption combined 
with engineering a second copy of a test gene such that the expression of the gene is regulated by 
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and inducible promoter such as the arabinose promoter to test the essentiality of the gene. Many of 
these techniques result in the insertion of large fragments of DNA into the gene of interest, such as 
a drug selection marker. An advantage of the technique described by Link et al. is that it does not 
rely on an insertion into the gene to cause a functional defect, but rather results in the precise 
5 removal of the coding region. This insures the lack of polar effects on the expression of genes 
downstream from the target gene. 

Recombinant DNA techniques can be used to express the entire coding sequences of the gene 
identified as required for proliferation, or portions thereof. The over-expressed proteins can be used as 
reagents for further study. The identified exogenous sequences are isolated, purified, and cloned into a 

10 suitable expression vector using methods well known in the art If desired, the nucleic acids can 
contain the sequences encoding a signal peptide to facilitate secretion of the expressed protein. 

Expression of fragments of the bacterial genes identified as required for proliferation is also 
contemplated by the present invention. The fragments of the identified genes can encode a polypeptide 
comprising at least 5, at least 10, at least 1 5, at least 20, at least 25, at least 30, at least 35, at least 40* at 

15 least 45, at least 50, at least 55, at least 60, at least 65, at least 75, or more than 75 consecutive amino 
acids of a gene complementary to one of the identified sequences of the present invention. The nucleic 
acids inserted into the expression vectors can also contain sequences upstream and downstream of the 
coding sequence. 

When expressing the coding sequence of an entire gene identified as required for bacterial 

20 proliferation or a fragment thereof, the nucleic acid sequence to be expressed is operably linked to a 
promoter in an expression vector using conventional cloning technology. The expression vector can be 
any of the bacterial, insect, yeast, or mammalian expression systems known in the art Commercially 
available vectors and expression systems are available from a variety of suppliers including Genetics 
Institute (Cambridge, MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and 

25 Invitrogen (San Diego, California). If desired, to enhance expression and facilitate proper protein 
folding, the cod on usage and codon bias of the sequence can be optimized for the particular expression 
organism in which the expression vector is introduced, as explained by Hatfield, et al., U.S. Patent No. 
5,082,767. Fusion protein expression systems are also contemplated by the present invention. 

Following expression of the protein encoded by the identified exogenous nucleic acid 

30 sequence, the protein is purified. Protein purification techniques are well known in the art. Proteins 
encoded and expressed from identified exogenous nucleic acid sequences can be partially purified 
using precipitation techniques, such as precipitation with polyethylene glycol. Alternatively, epitope 
tagging of the protein can be used to allow simple one step purification of the protein. 
Chromatographic methods usable with the present invention can include ion-exchange 

35 chromatography, gel filtration, use of hydroxyapaptite columns, immobilized reactive dyes, 
chromatofocusing, and use of high-performance liquid chromatography. Electrophoretic methods such 
one-dimensional gel electrophoresis, high-resolution two-dimensional polyaciylamide electrophoresis, 
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isoelectric focusing, and others are contemplated as purification methods. Also, affinity 
chromatographic methods, comprising antibody columns, ligand presenting columns and other affinity 
chromatographic matrices are contemplated as purification methods in the present invention. 

The purified proteins produced from the gene coding sequences identified as required for 
5 proliferation can be used in a variety of protocols to generate useful antimicrobial reagents. In one 
embodiment of the present invention, antibodies are generated against die proteins expressed from the 
identified exogenous nucleic acid sequences. Both monoclonal and polyclonal antibodies can be 
generated against the expressed proteins. Methods for generating monoclonal and polyclonal 
antibodies are well known in the art Also, antibody fragment preparations prepared from the produced 

1 0 antibodies discussed.above are contemplated 

In addition, die purified protein, fragments therof, or derivatives thereof may be administered 
to an individual in a pharmaceutically acceptable carrier to induce an immune response against the 
protein. Preferably, the immune response is a protective immune response which protects the 
individual. Methods for determining appropriate dosages of the protein and pharmaceutically 

1 5 acceptable carriers are familiar to those skilled in the art 

Another application for the purified proteins of the present invention is to screen small 
molecule libraries for candidate compounds active against the various target proteins of the present 
invention. Advances in the field of combinatorial chemistry provide methods, well known in the art, to 
produce large numbers of candidate compounds that can have a binding, or otherwise inhibitory effect 

20 on a target protein. Accordingly, the screening of small molecule libraries for compounds with binding 
affinity or inhibitory activity for a target protein produced from an identified gene sequence is 
contemplated by the present invention. 

The present invention further contemplates utility against a variety of other pathogenic 
organisms in addition to K coll For example, the invention has utility in identifying genes required for 

25 proliferation in prokaryotes and eukaryotes. For example, the invention has utility with protists, such 
as Plasmodium spp .and as Entamoeba spp.; plants; animals, such and Contracaecum spp; and fungi 
including Candida spp., ((e.g., Candida albicansCandida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr 
(also called Candida pseudotropicalis), Candida dublmiensis,)) 9 Saccharomyces cerevisiae, 

30 Cryptococcvs neojbrmans, and Aspergillus fumigatus. In one embodiment of the present invention, 
monera, specifically bacteria are probed in search of novel gene sequences required for proliferation. 
This embodiment is particularly important given the rise of drug resistant bacteria. 

The numbers of bacterial species that are becoming resistant to existing antibiotics are 
growing. A partial list of these organisms includes: Staphylococcus spp., such as S. aureus; 

35 Enterococcus spp., such E. faecalis; Pseudomonas spp., such as P. aeruginosa, Clostridium spp., 
such as C. bottdinwn or C. difficile, Haemophilus spp., such as H. influenzae, Enterobacter spp., 
such as E. cloacae, Vibrio spp., such as V. cholera; Moraxala spp., such as M catarrhalis; 
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Streptococcus sppt, such as S. pneumoniae, Neisseria spp., such as N gonorrhoeae; Mycoplasma 
spp., such as Mycoplasma pneumoniae; Salmonella typhimurium; Helicobacter pylori; Escherichia 
coli; and Mycobacterium tuberculosis. The sequences identified as required for proliferation in the 
present invention can be used to probe these and other organisms to identify homologous required 
S proliferation genes contained therein. 

In one embodiment of the present invention, the nucleic acid sequences disclosed herein are 
used to screen genomic libraries generated from bacterial species of interest other than E. coli. For 
example, the genomic library may be from Aspergillus Jumigatus, Bacillus anthracis, Burkholderia 
cepacia, Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis 

10 glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, 
Candida kefyr (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia 
pneumoniae, Chlamydia trachomatis, Clostridium botulinum, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter pylori, 
Klebsiella pneumoniae, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium 

15 tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, 
Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, 
Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, 
Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, 
Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis or 

20 any species falling within the genera of any of the above species. Standard molecular biology 
techniques are used to generate genomic libraries from various microorganisms. In one aspect, the 
libraries are generated and bound to nitrocellulose paper. The identified exogenous nucleic acid 
sequences of the present invention can then be used as probes to screen the libraries for homologous 
sequences. The homologous sequences identified can then be used as targets for the identification 

25 of new, antimicrobial compounds with activity against more than one organism. 

For example, the preceding methods may be used to isolate nucleic acids having a sequence 
with at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at least 70% identity to a 
nucleic acid sequence selected from the group consisting of one of the sequences of SEQ ID NOS. 
1-93, 106-112, 119-122, 134-160, 164-171, 179-265, 271-273, 275, and 279-286, fragments 

30 comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 consecutive 
nucleotides thereof, and the sequences complementary thereto. Identity may be measured using 
BLASTN version 2.0 with the default parameters. (Altschul, S.F. et al. Gapped BLAST and PSI- 
BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid Res. 25: 3389- 
3402 (1997)). For example, the homologous polynucleotides may have a coding sequence which is 

35 a naturally occurring allelic variant of one of the coding sequences described herein. Such allelic 
variants may have a substitution, deletion or addition of one or more nucleotides when compared to 
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the nucleic acids of SEQ ID NOs: 1-93, 106-112, 119-122, 134-160, 164-171, 179-265, 271-273, 
275, and 279-286 or the sequences complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which encode 
polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at 
5 least 60%, at least 50%, or at least 40% identity or similarity to a polypeptide having the sequence 
of one of SEQ ID NOs: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 and 472-479 
or to a polypeptide whose expression is inhibited by a nucleic acid of one of SEQ ID NOs.: 1-93, or 
fiagments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino 
acids of the preceding polypeptides as determined using the FASTA version 3.0t78 algorithm with 

10 the default parameters. Alternatively, protein identity or similarity may be identified using 
BLASTP with the default parameters, BLASTX with the default parameters, or TBLASTN with the 
default parameters. (Alschul, S.F. et al. Gapped BLAST and PSI-BLAST: A New Generation of 
Protein Database Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997)). 

Alternatively, homologous nucleic acids or polypeptides may be identified by searching a 

15 database to identify sequences having a desired level of homology to a nucleic acid or a polypeptide 
involved in proliferation or an antisense nucleic acid to a nucleic acid involved in microbial 
proliferation. A variety of such databases are available to those skilled in the art, including 
GenBank and GenSeq. In some embodiments, the databases are screened to identify nucleic acids 
or polypeptides having at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, at least 

20 70%, at least 60%, or at least 50%, at least 40% identity or similarity to a nucleic acid or 
polypeptide involved in proliferation or an antisense nucleic acid involved in proliferation. For 
example, the database may be screened to identify nucleic acids homologous to one of SEQ ID Nos. 
1-93, 106-112, 119-122, 134-160, 164-171, 179-265, 271-273, 275, and 279-286, homologous to 
fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 400, or 500 

25 consecutive nucleotides thereof, or homologous to the sequences complementary to any of the 
preceding nucleic acids. In other embodiments, the databases are screened to indetify polypeptides 
having at least 99%, 95%, 90%, 855, 80%, 70%, 60%, 50%, 40%, or at least 25% identity or 
similarity ot a polypeptide involved in proliferation or a portion thereof. For example, the database 
may be screened to identify polypeptides homologous to a polypeptide comprising one of SEQ ID 

30 NOs. 299-305, 312-315, 327-353, 357-364, 372^58, 464-466, 468 and 472-479, a polypeptide 
whose expression is inhibited by a nucleic acid of one of SEQ ED NOs: 1-93, or homologous to 
fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino 
acids of any of the preceding polypeptides. In some embodiments, the database may be screened to 
identify homologous nucleic acids or polypeptides from organisms otherthan 2?. coli, including 

35 organisms such as Aspergillus fumigates, Bacillus anthracis, Burkholderia cepacia, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kejyr 
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(also called Candida pseudotropicalis), Candida dubliniensis, Candida glabrata (also called 
Tortdopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida 
hrusei, Candida hefyr (also called Candida pseudotropicalis), Candida dubliniensis, Candida 
glabrata (also called Tortdopsis glabrata), Candida tropicalis, Candida parapsilosis, Candida 
5 guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), Candida 
dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatns, Clostridium botulmum, Clostridium 
difficile, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Haemophilus 
influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, Mycobacterium 
leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella 

10 cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella 
catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella somtei, Pseudomonas 
aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, 
Yersinia pestis or any species falling within the genera of any of the above species. 

15 In another embodiment, gene expression arrays and microarrays can be employed. Gene 

expression arrays are high density arrays of DNA samples deposited at specific locations on a glass 
chip, nylon membrane, or the like. Such arrays can be used by researchers to quantify relative gene 
expression under different conditions. Gene expression arrays are used by researchers to help 
identify optimal drug targets, profile new compounds, and determine disease pathways. An 

20 example of this technology is found in U.S. Patent No. 5807522. 

It is possible to study the expression of all genes in the genome of a particular microbial 
organism using a single array. For example, the arrays from Genosys consist of 12 x 24 cm nylon 
filters containing PCR products corresponding to 4290 ORFs from E. coli. 10 ngs of each are 
spotted every 1.5 mm on the filter. Single stranded labeled cDNAs are prepared for hybridization 

25 to the array (no second strand synthesis or amplification step is done) and placed in contact with the 
filter. Thus the labeled cDNAs are of "antisense" orientation. Quantitative analysis is done by 
phosphorimager. 

Hybridization of cDNA made from a sample of total cell mRNA to such an array followed 
by detection of binding by one or more of various techniques known to those in the art results in a 

30 signal at each location on the array to which cDNA hybridized. The intensity of the hybridization 
signal obtained at each location in the array thus reflects the amount of mRNA for that specific 
gene that was present in the sample. Comparing the results obtained for mRNA isolated from cells 
grown under different conditions thus allows for a comparison of the relative amount of expression 
of each individual gene during growth under the different conditions. 

35 Gene expression arrays may be used to analyze the total mRNA expression pattern at 

various time points after induction of an antisense nucleic acid complementary to a proliferation- 
required gene. Analysis of the expression pattern indicated by hybridization to the array provides 
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information on whether or not the target gene of the antisense nucleic acid is being affected by 
antisense induction, how quickly the antisense is affecting the target gene, and for later timepoints, 
what other genes are affected by antisense expression. For example, if the antisense is directed 
against a gene for ribosomal protein L7/L12 in the SOS subunit, its targeted mRNA may disappear 
5 first and then other mRNAs may be observed to increase, decrease or stay the same. Similarly, if 
the antisense is directed against a different SOS subunit ribosomal protein mRNA (e.g. L25), that 
mRNA may disappear first followed by changes in mRNA expression that are similar to those seen 
with the L7/L12 antisense expression. Thus, the mRNA expression pattern observed with an 
antinsense nucleic acid complementary to a proliferation required gene may identify other 

10 proliferation-required nucleic acids in the same pathway as the target of the antisense nucleic acid. 
In addition, the mRNA expression patterns observed with candidate drug compounds may be 
compared to those observed with antisense nucleic acids against a proliferation-required nucleic 
acid. If the mRNA expression pattern observed with the candidate drug compound is similar to that 
observed with the antisense nucleic acid, the drug compound may be a promising therapeutic 

IS candidate. Thus, the assay would be useful in assisting in the selection of candidate drug 
compounds for use in screening methods such as those described below. 

In cases where the source of nucleic acid deposited on the array and the source of the 
nucleic acid being hybridized to the array are from two different organisms, gene expression arrays 
can identify homologous genes in the two organisms. 

20 The present invention also contemplates additional methods for screening other . 

microorganisms for proliferation-required genes. In this embodiment, the conserved portions of 
sequences identified as proliferation-required can be used to generate degenerate primers for use in the 
polymerase chain reaction (PCR). The PCR technique is well known in the art The successful 
production of a PCR product using degenerate probes generated from the sequences identified herein 

25 would indicate the presence of a homologous gene sequence in the species being screened. This 
homologous gene is then isolated, expressed, and used as a target for candidate antibiotic compounds. 
In another aspect of this embodiment, the homologous gene is expressed in an autologous organism or 
in a heterologous organism in such a way as to alter the level or activity of a homologous gene required 
for proliferation in the autologous or heterologus organism. In still another aspect of this embodiment, 

30 the homologous gene or portion is expressed in an antisense orientation in such a way as to alter the 
level or activity of a nucleic acid required for proliferation of an autologous or heterologous organism. 

The homologous sequences to proliferation-required genes identified using the techniques 
described herein may be used to identify proliferation-required genes of organisms other than £ 
coli, to inhibit the proliferation of organisms other than £ coli by inhibiting the activity or reducing 

35 the amount of the identified homologous nucleic acid or polypeptide in the organism other than £ 
coli, or to identify compounds which inhibit the growth of organisms other than £ coli as described 
below. 
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In another embodiment of the present invention, K coli sequences identified as required for 
proliferation are transferred to expression vectors capable of function within non-is coli species. As 
would be appreciated by one of ordinary skill in the art, expression vectors must contain certain 
elements that are species specific. These elements can include promoter sequences, operator 
5 sequences, repressor genes, origins of replication,selectable marker genes, ribosomal binding 
sequences, termination sequences, and others. To use the identified exogenous sequences of the 
present invention, one of ordinary skill in the art would know to use standard molecular biology 
techniques to isolate vectors containing the sequences of interest from cultured bacterial cells, 
isolate and purify those sequences, and subclone those sequences into an expression vector adapted 
10 for use in die species of bacteria to be screened. 

Expression vectors for a variety of other species are known in the art For example, Cao et 
al. report the expression of steroid receptor fragments in Staphylococcus aureus. J. Steroid Biochem 
Mol Biol. 44(1): 1-1 1 (1993). Also, Pla et al. have reported an expression vector that is functional 

in a number of relevant hosts including: Salmonella typhimurium, Pseudomonas putida, and 

■> 

15 Pseudomonas aeruginosa. J. Bacteriol. 172(8):4448-55 (1990). These examples demonstrate the 
existence of molecular biology techniques capable of constructing expression vectors for the 
species of bacteria of interest to Hie present invention. 

Following the subcloning of the identified nucleic acid sequences into an expression vector 
functional in the microorganism of interest, the identified nucleic acid sequences are conditionally 

20 transcribed to assay for bacterial growth inhibition. Those expression vectors found to contain 
sequences that, when transcribed, inhibit bacterial growth are compared to the known genomic 
sequence of the pathogenic microorganism being screened or, if the homologous sequence from the 
organism being screened is not known, it may be identified and isolated by hybridization to the 
proliferation-required E. coli sequence interest or by amplification using primers based on the 

25 proliferation-required E. coli sequence of interest as described above. 

The antisense sequences from the second organism which are identified as described above 
may then be operably linked to a promoter, such as an inducible promoter, and introduced into the 
second organism. The techniques described herein for identifying £. coli genes required for 
proliferation may thus be employed to determine whether the identified sequences from a second 

30 organism inhibit the proliferation of the second organism. 

Antisense nucleic acids required for the proliferation of organisms other than E. coli or the 
genes corresponding thereto, may also be hybridized to a microarray containing the E. coli OKFs to 
gauge the homology between the E. coli sequences and the proliferation-required nucleic acids from 
other organisms. For example, die proliferation-required nucleic acid may be from Aspergillus 

35 fumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida albicans, 
Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida parapsilosis, 
Candida guilliermondii, Candida hrusei, Candida kejyr (also called Candida pseudotropicalis), 
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Candida dublmiensis, Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium botulinum, 
Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, 
Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, 
Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
5 aeruginosa, Salmonella cholerastds, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, 
Salmonella typhimurrum, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, 
Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sormei, 
Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema 
pallidum, Yersinia pestis or any species falling within the genera of any of the above species. The 
10 proliferation-required nucleic acids from an organism other than E coli may be hybridized to the 
array under a variety of conditions which permit hybridization to occur when the probe has 
different levels of homology to the sequence on the microarray. This would provide an indication 
of homology across the organisms as well as clues to other possible essential genes in these 
organisms. 

IS In still another embodiment, the exogenous nucleic acid sequences of the present invention 

that inhibit bacterial growth or proliferation can be used as antisense therapeutics for killing bacteria. 
The antisense sequences can be complementary to the proliferation-required genes whose sequence 
corresponds to the exogenous nucleic acid probes identified here (i.e. the antisense nucleic acid may 
hybridize to the gene or a portion thereof). Alternatively, antisense therapeutics can be complementary 

20 to operons in which proliferation-required genes reside (i.e. the antisense nucleic acid may hybridize to 
any gene in the operon in which the proliferation-required genes reside). Further, antisense 
therapeutics can be complementary to a proliferation-required gene or portion thereof with or without 
adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a gene), an intergenic 
sequence (Le. a sequence between genes), a sequence spanning at least a portion of two or more genes, 

25 a 5' noncoding region or a 3' noncoding region located upstream or downstream from the actual 
sequence that is required for bacterial proliferation or an operon containing a proliferation-required 
gene. 

In addition to therapeutic applications, the present invention encompasses the use of nucleic 
acid sequences complementary to sequences required for proliferation as diagnostic tools. For 

30 example, nucleic acid probes complementary to proliferation-required sequences that are specific for 
particular species of microorganisms can be used as probes to identify particular microorganism 
species in clinical specimens. This utility provides a rapid and dependable method by which to identify 
die causative agent or agents of a bacterial infection. This utility would provide clinicians the ability to 
prescribe species specific antimicrobial compounds to treat such infections. In an extension of this 

35 utility, antibodies generated against proteins translated from mRNA transcribed from proliferation- 
required sequences can also be used to screen for specific microorganisms that produce such proteins in 
a species-specific manner. 
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The following examples teach the genes of the present invention and a subset of uses for the K 
coli genes identified as required for proliferation. These examples are illustrative only and are not 
intended to limit the scope of the present invention. 

EXAMPLES 

5 The following examples are directed to the identification and exploitation of E. coli genes 

required few proliferation. Methods of gene identification are discussed as well as a variety of methods 
to utilize die identified sequences. 

Genes Identified as Required for Proliferation off. coli 

10 Exogenous nucleic acid sequences were cloned into an inducible expression vector and 

assayed for growth inhibition activity. Example 1 describes the examination of a library of 
exogenous nucleic acid sequences cloned into the IPTG-inducible expression vector pLEXSBA 
(Krause et al., J. Mol. Biol. 274: 365 (1997)) or a modified version of pLEX5BA, pLEX5BA-3' in 
which a synthetic linker containing a T7 terminator was ligated between the Pstl and HindUI sites 

15 of pLEXSBA. In particular, to construct pLEX5BA-3% the following oligonucleotides were 
annealed and inserted into the Pstl and HindlQ sites of pLEXSBA: 

5 » -GTCTAGCATAACCCCTTGGGGCCTCT^ 3 » (SEQ ID 

NO: 480) 

20 5 ' - AG CTT CAAAAAAC C CCTCAAGGACC CGTTTAGAGGCC C CAAGGGGTTAT 
GCTAGACTGCA- 3 1 (SEQ ID NO: 481) 

Random fragments of E. coli genomic DNA were generated by DNAsel digestion 
or sonication, filled in with T4 polymerase, and cloned into the Smal site of pLEX5BA or 
pLEX5BA-3\ Upon activation or induction, the expression vectors produced an RNA molecule 
corresponding to the subcloned exogenous nucleic acid sequences. The RNA product was in an 
antisense orientation with respect to the E. coli genes from which it was originally derived. This 
antisense RNA then interacted with sense mRNA produced from various E. coli genes and 
interfered with or inhibited the translation of the sense messenger RNA (mRNA) thus preventing 
protein production from these sense mRNA molecules. In cases where the sense mRNA encoded a 
protein required for the proliferation, bacterial cells containing an activated expression vector failed 
to grow or grew at a substantially reduced rate. Similar results have also been obtained in cases 
where the gene encodes a non-translated RNA, such as a ribosomal RNA. 

It will be appreciated that vectors other than pLEXSBA or pLEX5BA-3' may be used to 
transcribe the genomic DNA inserts. In addition, it will be appreciated that, if desired, pLEX5BA 
or pLEX5BA-3' may be modified to introduce features such as stop codons in all three reading 
frames downstream of the genomic DNA inserts to ensure that if the genomic DNA insert encodes a 
polypeptide (i.e. the insert is in the sense orientation rather than the antisense orientation or the 
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insert is in the antisense orientation but contains a cryptic ORF) translation of the polypeptide will 
terminate shortly after die genomic insert 

EXAMPLE 1 
Inhibition of Bacterial Proliferation after IPTG induction 
5 To study the effects of transcriptional induction in liquid medium, growth curves were carried 

out by back diluting cultures 1:200 into fresh media with or without 1 mM IPTG and measuring the 
OD450 every 30 minutes (min). To study the effects of transcriptional induction on solid medium, 10 2 , 
10 3 , 10 4 , 10 5 , 10 6 , 10 7 and 10 8 fold dilutions of overnight cultures were prepared. Aliquots of from 0.5 
to 3 fil of these dilutions were spotted on selective agar plates with or without 1 mM IPTG. After 
1 0 overnight incubation, the plates were compared to assess the sensitivity of the clones to IPTG. 

Of the numerous clones tested, some clones were identified as containing a sequence that 
inhibited K coli growth after IPTG induction. Accordingly, the gene to which the inserted nucleic acid 
sequence corresponds, or a gene within the operon containing the inserted nucleic acid, may be 
required for proliferation in K colL 
1 5 Characterization of Isolated Clones Negatively Affecting is. coli Proliferation 

Following the identification of those inserts that, upon expression, negatively impacted E. coli 
growth or proliferation, the inserts were isolated and subjected to nucleic acid sequence determination. 

EXAMPLE 2 

Nucleic Acid Sequence Determination of Identified Clones Expressing Nucleic Acid Fragments with 
20 Detrimental Effects of K coli Proliferation 

The nucleotide sequences for the exogenous identified sequences were determined using 
plasmid DNA isolated using QIAPREP (Qiagen, Valencia, CA) and methods supplied by the 
manufacturer. The primers used for sequencing the inserts were 5' - TGTTTATC AG ACCGCTT - V 
(SEQ ID NO: 1) and 5' - AC AATTTC AC AC AGCCTC - 3' (SEQ ID NO: 2). These sequences flank 
25 the polylinker in pLEX5BA. Sequence identification numbers (SEQ ID NOs) for the identified inserts 
are listed in Table I and discussed below. 

EXAMPLE 3 

Comparison Of Isolated Sequences to Known Sequences 
The nucleic acid sequences of the subcloned fragments obtained from the expression vectors 

30 discussed above were compared to known £. coli sequences in GenBank using BLAST version 1.4 or 
version 2.0.6 using the following default parameters: Filtering off, cost to open a gap=5, cost to extend 
a gap=2, penalty for a mismatch in the blast portion of run—3, reward for a match in the blast portion 
of run=l, expectation value (e)=10.0, word sizep=ll, number of one-line descriptions=100, number of 
alignments to show (B)=I00. BLAST is described in Altschui, J Mol Biol. 215:403-10 (1990). 

35 Expression vectors were found to contain nucleic acid sequences in both the sense and antisense 
orientations. The presence of known genes, open reading frames, and ribosome binding sites was 
determined by comparison to public databases holding genetic information and various computer 
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programs such as the Genetics Computer Group programs FRAMES and CODONPREFERENCE. 
Clones were designated as "antisense" if the cloned fragment was oriented to the promoter such that 
the KNA transcript produced was complementary to the expressed mRNA (or non-translated RNA) 
from a chromosomal locus. Clones were designated as "sense" if they coded for an RNA fragment that 
5 was identical to a portion of a wild type mRNA from a chromosomal locus. 

The sequences described in Examples 1-2 that inhibited bacterial proliferation and contained 
gene fragments in an antisense orientation are listed in Table 1. This table lists each identified sequence 
by: a sequence identification number, a Molecule Number, a gene to which the identified sequence 
corresponds, listed according to die National Center for Biotechnology Information (NCBI), Blattner 
10 (Science 277:1453-1474(1997); also contains the E. colt K-12 genome sequence), or Rudd (Micro, 
and MoL Rev. 62:985-1019 (1998)), nomenclatures. The CONT1G numbers for each identified 
sequence is shown, as well as the location of the first and last base pairs located on the E. coli 
chromosome. A Molecule Number with a "**" indicates a clone corresponding to an intergenic 
sequence. 

15 

TABLE I 

Identified Clones with Corresponding Genes and Operons 



SeqID 


Molecule 
Number 


Clone name 


Contig 


start 


stop 


Gene name 
(Blattner) 


Gene name 
(NCBI) 


1 


EcXA118a 


E1M10000131C06 


AE000408 


5299 


5440 


b3310 


rpIN 


2 


EcXA118b 


E1M10000152F04 


AE000408 


5300 


5452 


b3310 


iplN 


3 


EcXA118c 


E1M10000152H04 


AE000408 


5311 


5450 


b3310 


iplN 


4 


EcXA118d 


E1M10000153H03 


AE000408 


5299 


5475 


b3310 


rpIN 


5 


EcXA119 


E1M10000129F10 


AE000372 


2407 


2153 


b2883 


b2883 


6 


EcXA120 


E1M10000129G04 


AE000248 


2494 


2888 


bl509 


bl509 


7 


EcXA121 


E1M10000119D03 


AE000491 


1722 


2001 


b4191 




8 


EcXA122a 


1029-M5 


AE000300 


232 


1 


b2l08 


yehA 








AE000299 


10036 


9848 


b2107 


b2107 


9 


EcXA122b 


X3S208-17 


AE000300 


186 


1 


b2108 


yehA ! 








AE000299 


10036 


9870 


b2107 


b2107 


10 


EcXA122c 


E1M10000124C07 


AE000299 


10036 


9896 


b2107 


b2107 








AE000300 


78 


1 


b2108 


yehA 


11 


EcXA122d 


E1M10000125D06 


AE000300 


203 


1 


b2108 


yehA 








AE000299 


10036 


9892 


b2107 


b2107 


12 


EcXA123 


E1M10000106A08 


AE000410 


7832 


7968 


b3343 


yheL 


13 


EcXA124 


E1M10000106D02 


AE000408 


5951 


6225 


b3311 


rpsQ 








AE000408 


5951 


6225 


b3312 5 


rpmC 


14 


EcXA125 


E1M10000124D09 


AE000465 


7071 


7309 


b3901 


yiiL 


15 


EcXA126 


E1M10000124E02 


AE000441 


9429 


9316 


b3644 


yicC 


16 


EcXA127a 


E1M10000124E06 


AE000369 


1145 


856 


b2851 


b2851 


17 


EcXA127b 


E1M10000159A11 


AE000369 


1145 


763 


b2851 


b2851 


18 


EcXA128 


E1M10000124G04 


AE000171 


6676 


6427 


b0679 


nagE 


1 19 


EcXA129 


E1M10000127C09 


AE000123 


1270 


1506 


b0135, b0136 


yadC, yadK 


20 


EcXA130 


E1M10000146E05 


AE000450 


9592 


9828 


b3737, b3738 


atpE, atpB 


21 


EcXA131 


E1M10000162C01 


AE000383 


11211 


11350 


b3016 


b3016 
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SeqID 


Molecule 
Number 


Clone name 


Contig 


start 


stop 


Gene name 
(Blattner) 


Gene name 
(NCBI) 


22 


EcXA132 


E1M10000163A04 


AE000243 


5440 


5650 


bl465 


narV 


23 


EcXA133a 


E1M10000120F06 


AE000408 


5273 


5425 


b3309,b3310 


rplX, rplN 


24 


EcXA133b 


E1M10000132H04 


AE000408 


5263 


5422 


b3309,b3310 


rplX,rplN 


25 


EcXA133c 


E1M10000172C05 


AE000408 


5190 


5433 


b3309,b3310 


rplX, rplN 


26 


EcXA134 


E1M10000162H06 


AE000402 


4689 


4795 


b3229 


sspA 


27 


EcXA136a 


E1M10000147B03 


AE000373 


1 


251 


b2892 


red 








AE000372 


11967 


12144 


b2891 


prfB 


28 


EcXA136b 


E1M10000155F04 


AE000373 


1 


372 


b2892 


red 








AE000372 


12002 


12144 


b2891 


prfB 


29 


EcXA137 


EIM10000142F12 


AE000132 


9172 


8451 


b0243, b0244 


proA, thrW 


30 


EcXA138 


E1M10000148B09 


AE000393 


10354 


10051 


b3121 


yhaC 


31 


EcXA139 


E1M10000143A12 


AE000189 


3600 


4024 


b0874 


b0874 


32 


EcXA139b 


E1M10000169H02 


AE000189 


3646 


4026 


b0874 


b0874 


33 


EcXA140a 


E1M10000143C10 


AE000445 


1 


260 


b3672 


b3672 








AE000444 


12573 


12697 


b3671 


ilvB 


34 


EcXAHOb 


E1M10000151F10 


AE000445 


1 


378 


b3672 


b3672 








AE000444 


12576 


12697 


b3671 


ilvB 


35 


EcXA141 


E1M10000143G09 


AE000294 


9227 


9498 


b2035 


rfc 


36 


EcXA142 


E1M10000159C10 


AE000408 


1 


199 


b3299 


rpmJ 








AE000407 


10488 


10601 


b3298 


rpsM 


37 


EcXA143 


E1M10000159F03 


AE000368 


7705 


7317 


b2846 


b2846 


38 


EcXA144a 


E1M10000160A03 


AE000257 


545 


368 


b!613 


manA 


39 j 


EcXA144b 


E1M10000132F02 


AE000257 


585 


395 


b!613 


manA 


40 


EcXA144c 


E1M10000153E04 


AE000257 


552 


395 


b!613 


manA 


41 


EcXA145 


E1M10000160A06 


AE000366 


5350 


5533 


b2827 


thyA 


42 


EcXA146 


E1M10000160C02 


AE000449 


9694 


9815 


b3725 


pstB 


43 


EcXA147 


E1M10000160C03 


AE000211 


6964 


7126 


Mill 


bl 111 


44 


EcXA148 


E1M10000160E10 


AE000455 


10737 


10577 


b3794 


rffM 


45 


EcXA149a 


E1M10000160F11 


AE000374 


7930 


7680 


b2912 


yefA 


46 


EcXA149b 


E1M10000153F05 


AE000374 


7910 


7625 


b2912 


yefA 


47 


EcXA150 


E1M10000160H03 


AE000442 


5556 


5377 


b3650 


spoT 

Jr 


48 


EcXA151 


E1M10000144E03 


AE000429 


8099 


8352 


b3528 


dctA 


49 


EcXA152 


E1M10000144G07 


AE000308 


1086 


1385 


b2182 


bcr 


50 


EcXA153 ' 


E1M10000144B01 


AE000477 


4261 


4055 


b4038 


yjbl 


51 


EcXA154 


E1M10000150E02 


AE000290 


5618 


5406 


bl983 


b!983 


52 j 


EcXA155 


E1M10000150E06 


AE000339 


116 


496 


b2522 


sseB 


53 


EcXA156 


E1M10000156B08 


AE000463 


2852 


2699 


b3874 


yihN 


54 


EcXA157 


E1M10000156D07 


AE000408 


7274 


7464 


b3314,b3315 


rpsC, rplV 


.55 


EcXA158 


E1M10OOO156G12 


AE000338 


5181 


5370 


b2519 


b2519 


56 


EcXA159 


E1M10000151G10 


AE000154 


7217 


7618 


b0482 


b0482 


57 


EcXA160 


E1M1000O166GO6 


AE000269 


4488 


5074 


M744 


bl744 


58 


EcXA161 


869.A23 


AE000146 


3702 


3950 


b0398 


sbcD 


59 


EcXA162 


E1M10000112F05 


AE000497 


7921 


8156 


b4267 


yjgv 


60 


EcXA163 


E1M10000118B05 


AE000447 


8832 


8741 


b3705 


yidC 


61 


EcXA164 


E1M10000118C04 


AE000314 


6749 


6688 


b2243 


glpC 


62 


EcXA165 


E1M10000118C05 


AE000170 


219 


290 


b0655 


ybeJ 


63 


EcXA166 


E1M100001 1SG06 


AE00O461 


5835 


5770 


b3S59 


yihE 


64 


EcXA167 


E1M1 00001 19A05 j 


AE000298 


9388 


9521 


b2093 


gatB 


65 


EcXA168 


E1M10000123E09 


AE000267 


4622 


4413 


bl721 


b!721 


66 


EcXA169 


E1M10000123F11 


AE000408 


2650 


2767 


b3303 


rpsE 


67 


EcXA170 


E1M10000129G01 


AE000143 


3523 


3913 


b0363 


b0363 


68 


EcXA171 


E1M10000132G08 


AE000189 


6799 


6699 


b0876 


b0876 


! 69 


EcXA172 


E1M10000139B11 


AE000323 


3697 


3531 


b2345 


b2345 


70 


EcXA173 


E1M10000153A09 


AE000424 


6886 


6968 


b3485 


yhhJ 
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SeqID 


Molecule 
Number 


Clone name 


Contig 


start 


stop 


Gene name 
(Blattner) 


Gene name 
(NCBI) 


71 


EcXA174 


E1M10000153C04 


AE000377 


8932 


8832 


b2945 


endA 


72 


EcXA175 


E1M10000154F03 


AE000279 


7957 


7837 


bl854 


pykA 


73 


EcXA176 


E1M10000157B02 


AE000485 


4932 


5094 


M125 


XjdH 


74 


EcXA177 


E1M10000160H05 


AE000326 


3140 


3035 


b2378 


b2378 


75 


EcXA178 


E1M10000163H01 


AE000113 


1 


266 


b0023 


rpsT 


76 


EcXA179 


E1M10000164G04 


AE000262 


233 


1 


bl665 


vaTV 


77 


EcXA180 


E1M10000168F02 


AE000480 


3110 


3350 


b4066 


yjcF 


78 


EcXA181 


EIM10000169D08 


AE000469 


10535 


10604 


b3956 


ppc 


79 


EcXA182 


E1M10000169H07 


AE000376 


10447 


10586 


b2935 


tktA 


80 


EcXA183 


E1M10000169H08 


AE000118 


8416 


8228 


b0084 


ftsl 


81 


EcXA184 


E1M10000172C01 


AE000358 


8498 


8844 


b2750 


cysC 


! 82 


EcXA185 


E1M10000176C01 


AE000407 


10053 


10119 


b3297 


rpsK 


83 


EcXA186 


E1M10000176F01 


AE000135 


987 


812 


b0270 


yagG 


84 


EcXA187 


P319-3.M16 


AE000317 


2279 


1872 


b2271 


b2271 


85 


EcXA188 


P347.2 


AE000211 


7845 


8005 


b!113 


M113 


; 86 


EcXA189a 


E1M10000168A04 


AE000277 


2048 


1861 


M828 


bl828 


87 


EcXA189b 


E1M10000125F09 


AE000277 


2042 


1868 


bl828 


bl828 


88 


EcXA189c { 


E1M10000113F02 


AE000277 


2050 


1857 


bl828 


bl828 


89 


EcXA190a 


E1M10000161D04 


AE000387 


49 


312 


b3052 


b3052 


90 


EcXA190b 


E1M10000155B11 


AE000387 


49 


285 


b3052 


b3052 


91 


EcXA191a 


E1M10000167H09 


AE000451 


18 


220 


b3740 


gidB 


92 


EcXA191b ; 


E1M10000120H05 


AE000451 


1 


188 


b3740 


gidB 


93 


EcXA192 


E1M10000154H07 


AE000452 


3326 


3106 


b3750,b3751 


rbsC, rpsB 



EXAMPLE 4 

Identification of Genes and their Corresponding Operons Affected by Antisense Inhibition 
5 The sequencing of the entire K coli genome is described in Blattner et al., Science 277: 1453- 

1474(1997) and the sequence of the genome is listed in GenBank Accession No.U00096. The 
operons to which the proliferation- inhibiting nucleic acids correspond were identified using 
RegulonDB and information in the literature. The coordinates of the boundaries of these operons on 
the K coli genome are listed in Table IDL Table II lists the molecule numbers of the inserts containing 

1 0 the growth inhibiting nucleic acid fragments, the genes in the operons corresponding to the inserts, the 
SEQ ID NOs of the genes containing the inserts, the SEQ ID NOs of the proteins encoded by the 
genes, the start and stop points of the genes on the K coli genome, the orientation of die genes on the 
genome, whether the operons are predicted or documented, and the predicted functions of the genes. 
The identified operons, their putative functions, and whether or not the genes are presently thought to 

15 be required for proliferation are discussed below. 

Functions for the identified genes were determined by using either Blattner functional class 
designations or by comparing identified sequence with known sequences in various databases. A 
variety of biological functions were noted for the genes to which the clones of the present invention 
correspond. Hie functions for the genes of interest appear in Table II. 

20 The proteins that are listed in Table II are involved in a wide range of biological functions. 
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Predicted fanction 


Unknown 


Unknown | 


Unknown | 


Unknown | 


Unknown 


Unknown 


neaA; 30S ribosomal subunit protein S17 


SOS ribosomal subunit protein L29 | 


SOS ribosomal subunit protein L16 


30S ribosomal subunit protein S3 


eryB; erythromycin sensitivity; SOS 
ribosomal subunit protein L22 


30S ribosomal subunit protein S19 


50S ribosomal subunit protein L2 | 


50S ribosomal subunit protein L23 


eryA; SOS ribosomal subunit protein L4; 
eiythromycin sensitivity 


50S ribosomal subunit protein L3 


nusE; 30S ribosomal subunit protein S10 


Unknown 


Unknown 


Unknown 


pstN; N-acetylglucosamine-specific enzyme 
II of phosphotransferase system 


Unknown 


Unknown j 


Unknown 1 


Unknown 


operon 
prediction 






predicted 








documented 






















predicted 


| predicted 


predicted 


documented 


predicted 








Orientation 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


A 


A 


A 


V 


V 


V 


V 


Stop 


2189665 


2190242 


3472602 


3472969 


1 3473355 


3474089 


3446205 


3446396 


3446806 


3447520 


3447870 


3448163 


3449001 


3449321 


3449923 


3450563 


3450907 


4091019 


3815166 


2989781 


705113 


150953 


151599 


152231 


152854 


Start 


,2188946 


2189700 


j3472315 | 


3472610 


3472969 


3473355 


3445951 


3446205 


3446396 


3446819 


3447538 


3447885 


3448180 


3449019 


3449318 


3449934 


3450596 


4090705 


3814303 


2989290 


703167 


149715 


151003 


151626 


152243 


SeqID 
(Protein) 


o 
i— • 
cn 


i—* 
t— » 

cn 


CN 

cn 


m 
i—» 
cn 


cn 


VTi 

cn 


VO 

cn 


cn 


OO 

cn 


ON 

cn 
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CN 

cn 


CN 

cn 


CN 
CN 

m 


m 
CN 
cn 


CN 

m 


m 

CN 

cn 


VO 
CN 

m 


CM 

cn 


OO 
CM 

cn 


ON 
CM 
cn 


o 
cn 
cn 


cn 
cn 


CN 
cn 
cn 


cn 
cn 
m 


cn 
cn 


SeqED 
(Gene) 


»— » 


oo 


on 


o 

CN 


CN 


CN 
CN 


m 

CN 




vn 

CN 


vo 

CN 


r- 

CN 


oo 

CN 


Os 
CN 


o 

cn 


i— » 
cn 


CN 
cn 


cn 
m 


cn 


cn 


vo 
cn 


cn 


oo 
cn 


ON 

cn 


O 




Gene 


yehC 










NO 

<>n 


rpsQ 


rpmC 


1 


& 




I 


1 




1 


\rplC 


\rpsJ 


\yiiL 


\yicC 


*n 

JO 
oo 

CNJ 


1 
R 


yadC 


! 


\yadL 




Molecule 
Number" 






EcXA123 1 








EcXA124 






















EcXA125 


EcXA126 


EcXA127a-b 


oo 

CM 

i 


EcXA129 
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Predicted function 


papG, uncC; membrane-bound ATP 
synthase, Fl sector, -subunit (EC 3.6.1.3) 


papB, uncD; membrane-bound ATP 
synthase, Fl sector, -subunit (EC 3.6.1.3) 


papC, uncG; membrane-bound ATP 
synthase, Fl sector, -subunit (EC 3.6.1.3) 


papA, uncA; membrane-bound ATP 
synthase, Fl sector, -subunit (EC 3.6.1.3) 


papE, uncH; membrane-bound ATP 
synthase, Fl sector, -subunit (EC 3.6.1.3) 


papF, uncF; membrane-bound ATP 
synthase, F0 sector, subunit b (EC 3.6.1.3) 


papH, uncE; membrane-bound ATP 
synthase, F0 sector, subunit c; DCCD (EC 
3.6.1.3) 


papD, uncB; membrane-bound ATP 
synthase, F0 sector, subunit a (EC 3.6.1.3) 


unci; membrane-bound ATP synthase 
subunit, Fl-FO-type proton-ATPase (EC 
3.6.1.34) 


Unknown i 


Unknown \ 


regulationCryptic nitrate reductase n, - 
subunit 


regulationCryptic nitrate reductase II, - 
subunit 


regulationCryptic NR II, -subunit | 


regulationCryptic NR II, -subunit | 




operon 
prediction 


documented 


















predicted 




documented 








Same 
operon as 
EcXA118 


Orientation 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 


V 




a* 
$ 


3913600 


3915003 


3915893 


3917485 


3918031 


3918516 


3918817 


3919679 


3920080 


3158185 


3159162 


1534641 


1535333 


1536877 


1540614 




Start 


3913181 


3913621 


3915030 


3915944 


3917498 


3918046 


3918578 


3918864 


3919688 


3156944 


3158185 


1533961 


1534638 


1535333 


1536874 




SeqID 
(Protein) 
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(Gene) 
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Gene 
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atpG 
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atpF 


X 
f 


atpB 


atpl 
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so 
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S| 


c3 
Si 
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Molecule 
Number' 


o 

CO 
1— » 

1 


















EcXA131 




EcXA132 








EcXA133a-c 
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Predicted function 


Stress response protein 


pog; stress response protein 


asuD herC; lysyl tRNA synthetase, 
constitutive 


supK; peptide chain release factor 2 | 


Single-stranded DNA-specific exonuclease, 
5-3 
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Functions for the identified genes were assigned using either Blattner functional class 
designations, functions referenced in Bertyn, MKB "Linkage Map of Escherichia coli K-12, Edition 
10: The Traditional Map 7 *. (1998) Microbiol. Mol. Biol. Rev. September 62 (3): 814-984, or by 
comparing identified sequence with known sequences in various databases. A variety of biological 
5 functions were noted for the genes to which the clones of the present invention correspond. Biological 
functions for genes that lie on the same operon as an identified gene have also been made. The 
functions for the genes of interest appear in Table II. 

The genes of interest have a variety of biological functions. For example, genes that are 
thought to function as transport or binding proteins, that participate in translation or post-translational 
10 modification, that are involved in carbon compound catabolism, that are thought to be enzymes, 
participate in cell processes, energy metabolism and biosynthetic functions appear in Table II. Genes 
that are involved in cell structure, transcription, RNA processing and degradation also appear in Table 

n. 

Several of the expression vectors contain fragments that correspond to genes of unknown 
15 function or if the function is known, it is not known whether the gene is essential. For ©cample, 
EcXAll9, 120, 121, 122a-d, 123, 125, 126, 127a-b, 128, 129, 131, 132, 138, 139a-b, 141, 143, 146, 
147, 14, 149a-b, 152, 153, 154, 155, 156, 158, 159, 160, 162, 163, 164, 165, 166, 167, 168, 170, 171, 
172, 173, 176, 177, 180, 181, 186, 187, 188, 1 89a-b, 190a-b, 191a-b, and 192 are all exogenous nucleic 
acid sequences that correspond to E. coli proteins that have no known function or where the function 
20 has not been shown to be essential or nonessential. 

The present invention reports a number of novel E. coli genes and operons that are required for 
proliferation. From the list of clone sequences identified here, each was identified to be a portion of a 
gene in an operon required for the proliferation of E. coli. Cloned sequences corresponding to genes 
already known to be required for proliferation in K coli include EcXAl 18a-d, 124, 130, 133a-c, 136a- 
25 b, 142, 145, 150, 157, 169, 178, 182, 183 and 185 are exogenous nucleic acid sequences that 
correspond to E coli genes that are known to be required for cellular proliferation. The remaining 
identified sequences correspond to K coli genes previously undesignated as required for proliferation 
in the art. 

An interesting observation of the present invention is that there are also several sequence 
30 fragments that correspond to E coli genes that are not thought to be required for E. coli proliferation. 
Nevertheless, under the conditions described above, the antisense expression of these gene fragments 
causes a reduction in cell growth. This result implies that the genes corresponding to the identified 
sequences are actually required for proliferation or are in operons required for proliferation. Molecule 
Nos. corresponding to these genes are EcXA128, 134, 137, 140a-b, 144a-c, 151, 161, 174, 175, and 
35 184. 

Following identification of the sequences of interest, these sequences were localized into 
operons. Since bacterial genes are expressed in a polycistronic manner, the antisense inhibition of a 
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single gene in an operon might effect the expression of all the other genes on the operon or the genes 
down stream from the single gene identified. In order to determine which of the gene products in an 
operon are required for proliferation, each of the genes contained within an operon may be analyzed for 
their effect on viability as described below. 
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j*ouyu / 


T7r»YA 1 0K 




/1AQ1 A1 Q 

fuyiuiy 


XiC A/\ x ZO 




JoiDXOO 


EcXA127a-b 


2989290 


2989781 


rSCAAlzo 


703167 


7U5113 


rXA/VlZV 


1 >1tV71 C 
14^ /It) 


1 COQC/I 


T7_,'V A 1 Q A 

riCAAliU 


jyiiioi 


1 A9AAOA 


r ft YA lit 


1 1 C/CA/t/1 


jioyioz 


r-YA 1 lO 
JbCAA loZ 




1 j4Uo14 


£XA/\1 jja-c 


Same operon as 

FpYA 1 1 R 






3373Q1A 






3ft3 1 £77 
jvj lO / / 


3037763 








FcXA13R 


19 65474 


3967943 

D£AJ I ZtO 


FcXA139a-h 


9131R1 


Q1419R 


EcXA140a-b 

UvAAitva U 


3R4R49Q 


3R50615 


FcXA141 


9099Q17 


Z. IvUJJ 1 


FcXA142 


Qgmp Anprnn ac 
OoillC t/JJvlUIl do 

EcXA118 




EcXA143 


2985498 


2986190 


EcXA144a-c 


1686600 


1687775 


EcXA145 


2962383 


2964059 


EcXA146 


3904481 


3909153 


EcXA147 


1167423 


1168133 


EcXA148 


3972208 


3976217 


EcXA149a-b 


3054261 


3054809 


EcXA150 


3819733 


3824918 


EcXA151 


3679791 


3681077 


EcXA152 


2276590 


2278503 


EcXAl53 


4248534 


4249862 


EcXA154 


2054880 


2055596 


EcXA155 


2652177 


2652962 


EcXA156 


4059826 


4061091 


EcXAl57 


Same operon as 
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Molecule Number 


Start 


Stop 




EcXA124 




EcXAI58 


2643033 


2645345 




2645346 


2650307 


EcXA159 


506510 


507304 


EcXA160 


1823979 


1830006 


EcXA161 


411831 


416176 


EcXA162 


4490155 


4491974 


EcXA163 


3882705 


3884351 


EcXA164 


2350667 


2354731 


EcXA165 


686062 


686970 


EcXA166 


4039996 


4041625 


EcXAL67 


2169417 


2175230 


EcXA168 


1801118 


1803017 


EcXA169 


Same operon as 
EcXA118 




EcXA170 


381728 


383159 


EcXA171 


915696 


917354 


EcXA172 


2461032 


2462090 


EcXA173 


3623310 


3628232 


EcXA174 


3088366 


3089073 


EcXAl75 


1935673 


1937115 


EcXA176 


4346893 


4349240 


EcXA177 


2493599 


2494585 


EcXA178 


20815 


21078 


EcXA180 


4279362 


4280654 


EcXA181 


4148026 


4150677 


EcXA182 


3077663 


3079654 


EcXA183 


89634 


103153 


EcXA184 


2871410 


2873442 


EcXA185 


3437253 


3440108 


EcXA186 


284619 


287623 


EcXA187 


2383874 


238485 l 


EcXA188 


1168635 


ll 69597 


EcXA189a-c 


1908189 


1909673 


EcXA190a-b 


3192961 


3198606 


EcXA191a-b 


3920685 


3924083 


EcXA192 


3930941 


3935849 



EXAMPLES 

5 Identification of Individu al Genes within an Operon Required for Proliferation 

The following example illustrates a method for determining which gene in an operon is 
required for proliferation. The clone insert corresponding to Molecule No. EcXA119 possesses 
nucleic acid sequence homology to the K coli gene b2883. This gene is located in an operon 
containing the b2882 t b2883, b2884 y and b2885 genes. To determine which gene or genes in this 
10 operon are required for proliferation, each gene is selectively inactivated using homologous 
recombination. Gene b2885 is the first gene to be inactivated. 



-46- 



WO 01/48209 PCT/US00/34419 

Deletion inactivation of a chromosomal copy of a gene in E. coli can be accomplished by 
integrative gene replacement The principle of this method (Hamilton, C. M., et al 1989. J. 
BacterioL 171: 4617-4622) is to construct a mutant allele of the targeted gene, introduce that allele 
into the chromosome using a conditional suicide vector, and then force the removal of the native 
5 wild type allele and vector sequences. This will replace the native gene with a desired mutation(s) 
but leave promoters, operators, etc. intact Essentiality of a gene is determined either by deduction 
from genetic analysis or by conditional expression of a wild type copy of the targeted gene (trans 
complementation). 

The first step is to generate a mutant b2885 allele using PCR amplification. Two sets of 

10 PCR primers are chosen to produce a copy of b2885 with a large central deletion to inactivate the 
gene. In order to eliminate polar effects, it is desirable to construct a mutant allele comprising an 
in-frame deletion of most or all of the coding region of the b2885 gene. Each set of PCR primers is 
chosen such that a region flanking the gene to be amplified is sufficiently long to allow 
recombination (typically at least 500 nucleotides on each side of the deletion). The targeted 

15 deletion or mutation will be contained within this fragment To facilitate cloning of the PCR 
product, the PCR primers may also contain restriction endonuclease sites found in the cloning 
region of a conditional knockout vector such as pK03 (Link, et al 1997 J. BacterioL 179 (20): 
6228-6237). Suitable sites include NoA, SaK, BamHL and Smdl. The b2885 gene fragments are . 
produced using standard PCR conditions including, but not limited to, those outlined in the 

20 manufacturers directions for the Hot Start Taq PCR kit (Qiagen, Inc., Valencia, CA). The PCR 
reactions will produce two fragments that can be fused together. Alternatively, crossover PCR can 
be used to generate a desired deletion in one step (Ho, S. N., et al 1989. Gene TJ: 5 1-59, Horton/R. 
M., et al 1989. Gene 77: 61-68). The mutant allele thus produced is called a "null" allele because it 
cannot produce a functional gene product 

25 The mutant allele obtained from PCR amplification is cloned into the multiple cloning site 

of pK03. Directional cloning of the b2885 null allele is not necessary. The pK03 vector has a 
temperature-sensitive origin of replication derived from pSClOi. Therefore, clones are propagated 
at the permissive temperature of 30°C. The vector also contains two selectable marker genes: one 
that confers resistance to chloramphenicol and another, the Bacillus snbtilis sacB gene, that allows 

30 for counter-selection on sucrose containing growth medium. Clones that contain vector DNA with 
the null allele inserted are confirmed by restriction endonuclease analysis and DNA sequence 
analysis of isolated plasmid DNA. The plasmid containing the b288 5 null allele insert is known as 
a knockout plasmid. 

Once the knockout plasmid has been constructed and its sequence verified, it is transformed 
35 into a Rec 4 * E. coli host cell. Transformation can be by any standard method such as 
electroporation. In some fraction of the transformed cells, plasmids will integrate into the E coli 
chromosome by homologous recombination between the b2885 null allele in the plasmid and the 
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b2885 gene in the chromosome. Transformant colonies in which such an event has occurred are 
readily selected by growth at the non-permissive temperature of 43°C and in the presence of 
choramphenicol. At this temperature, the plasmid will not replicate as an episome and will be lost 
from cells as they grow and divide. These cells are no longer resistant to chloramphenicol and will 
5 not grow when it is present However, cells in which die knockout plasmid has integrated into the 
E. coli chromosome remain resistant to chloramphenicol and propagate. 

Cells containing integrated knock-out plasmids are usually die result of a single crossover 
event that creates a tandem repeat of the mutant and native wild type alleles of b2885 separated by 
the vector sequences. A consequence of this is that B2885 will still be expressed in these cells. In 

1 0 order to determine if the gene is essential for growth, the wild type copy must be removed. This is 
accomplished by selecting for plasmid excision, a process in which homologous recombination 
between the two alleles results in looping out of the plasmid sequences. Cells that have undergone 
such an excision event and have lost plasmid sequences including sacB gene are selected for by 
addition of sucrose to the medium. The sacB gene product converts sucrose to a toxic molecule. 

15 Thus counter selection with sucrose ensures that plasmid sequences are no longer present in the 
cell. Loss of plasmid sequences is further confirmed by testing for sensitivity to chloramphenicol 
(loss of the chloramphenicol resistance gene). The latter test is important because occasionally a 
mutation in the sacB gene can occur resulting in a loss of sacB function with no effect on plasmid 
replication (Link, et. al., 1997 J. Bacteriol 179 (20): 6228-6237). These artifact clones retain 

20 plasmid sequences and are therefore still resistant to chloramphenicol. 

In the process of plasmid excision, one of the two b2885 alleles is lost from the 
chromosome along with the plasmid DNA. In general, it is equally likely that the null allele or the 
wild type allele will be lost. Therefore, if the b2885 gene is not essential, half of the clones 
obtained in this experiment will have the wild type allele on the chromosome and half will have the 

25 null allele. However, if the b2885 gene is essential, cells containing the null allele will not be 
obtained as a single copy of the null allele would be lethal. 

To determine the essentiality of b2885, a statistically significant number of the resulting 
clones, at least 20, are analyzed by PCR amplification of the b2885 gene. Since the null allele is 
missing a significant portion of the b2885 gene, its PCR product is significantly shorter than that of 

30 the wild type gene and the two are readily distinguished by gel electrophoretic analysis. The PCR 
products may also be subjected to sequence determination for further confirmation by methods well 
known to those in the art. 

The above experiment is generally adequate for determining the essentiality of a gene such 
as b2885. However, it may be necessary or desirable to more directly confirm the essentiality of 

35 the gene. There are several methods by which this can be accomplished. In general, these involve 
three steps: 1) construction of an episome containing a wild type allele, 2) isolation of clones 
containing a single chromosomal copy of the mutant null allele as described above but in the 
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presence of the episomal wild type allele, and then 3) determining if the cells survive when the 
expression of the episomal allele is shut off. In this case, the trans copy of wild type b2885 is made 
by PCR cloning of the entire coding region of B2885 and inserting it in the sense orientation 
downstream of an inducible promoter such as the K coli lac promoter. Transcription of this allele 
5 of b2885 will be induced in the presence of IPTG which inactivates the lac repressor. Under IPTG 
induction b2885 protein will be expressed as long as the recombinant gene also possesses a 
ribosomal binding site, also known as a "Shine-Dalgarno Sequence". The trans copy of b2885 is 
cloned on a plasmid that is compatible with pSClOl. Compatible vectors include plSA, pBR322, 
and die pUC plasmids, among others. Replication of the compatible plasmid will not be 

10 temperature-sensitive. The entire process of integrating the null allele of b2885 and subsequent 
plasmid excision is carried out in the presence of IPTG to ensure the expression of functional b2885 
protein is maintained throughout After the null b2885 allele is confirmed as integrated on the 
chromosome in place of the wild type b2885 allele, then IPTG is withdrawn and expression of 
functional b2885 protein shut off. If the b2885 gene is essential, cells will cease to proliferate 

15 under these conditions. However, if the b2885 gene is not essential, cells will continue to 
proliferate under these conditions. In this experiment, essentiality is determined by conditional 
expression of a wild type copy of the gene rather than inability to obtain the intended chromosomal 
disruption. 

An advantage of this method over some other gene disruption techniques is that the targeted 
20 gene can be deleted or mutated without the introduction of large segments of foreign DNA. 
Therefore, polar effects on downstream genes are eliminated or minimized. There are methods 
described to introduce inducible promoters upstream of potential essential bacterial genes. 
However in such cases, polarity from multiple transcription start points can be a problem. One way 
of preventing this is to insert a gene disruption cassette that contains strong transcriptional 
25 terminators upstream of the integrated inducible promoter (Zhang, Y, and Cronan, J. E. 1996 J. 
Bacteriol 178 (12): 3614-3620). The described techniques will all be familiar to one of ordinary 
skill in the art 

Following the analysis of the b2885 gene, the other genes of the operon are investigated to 
determine if they are required for proliferation. 
30 EXAMPLE 6 

Expression of the Proteins Encoded by Genes Identified as Required for E. coli Proliferation 
The following is provided as one exemplary method to express the proliferation-required 
proteins encoded by the identified sequences described above. First, the initiation and termination 
codons for the gene are identified. If desired, methods for improving translation or expression of the 
35 protein are well known in the art For example, if the nucleic acid encoding the polypeptide to be 
expressed lacks a methionine codon to serve as the initiation site, a strong Shine-Delgarno sequence, or 
a stop codon, these sequences can be added. Similarly, if the identified nucleic acid sequence lacks a 
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transcription termination signal, this sequence can be added to the construct by, for example, splicing 
out such a sequence from an appropriate donor sequence. In addition, die coding sequence may be 
operably linked to a strong promoter or an inducible promoter if desired. The identified nucleic acid 
sequence or portion thereof encoding the polypeptide to be expressed is obtained by PCR from the 
5 bacterial expression vector or genome using oligonucleotide primers complementary to the identified 
nucleic acid sequence or portion thereof and containing restriction endonuclease sequences for Ncol 
incorporated into the 5 9 primer and BgHl at the 5' end of the corresponding 3 '-primer, taking care to 
ensure that the identified nucleic acid sequence is positioned in frame with the termination signal. Hie 
purified fragment obtained from the resulting PCR reaction is digested with Ncol and 2?g/H, purified 

10 and ligated to an expression vector. 

The ligated product is transformed into DH5a or some other R coli strain suitable for the over 
expression of potential proteins. Transformation protocols are well known in the art For example, 
transformation protocols are described in: Current Protocols in Molecular Biology, Vol. 1, Unit 1 .8, 
(Ausubel, et al., Eds.) John Wiley & Sons, Inc. (1997). Positive transformants are selected after 

15 growing the transformed cells on plates containing 50-100 pg/ml Ampicillin (Sigma, St Louis, 
Missouri). In one embodiment, the expressed protein is held in the cytoplasm of the host organism. In 
an alternate embodiment, the expressed protein is released into the culture medium. In still another 
alternative, the expressed protein can be sequestered in the periplasmic space and liberated therefrom 
using any one of a number of cell lysis techniques known in the art. For example, the osmotic shock 

20 cell lysis method described in Chapter 16 of Current Protocols in Molecular Biology, Vol. 2,. 
(Ausubel, et al., Eds.) John Wiley & Sons, Inc. (1997). Each of these procedures can be used to 
express a proliferation-required protein. 

Expressed proteins, whether in the culture medium or liberated from the periplasmic space or 
the cytoplasm, are then purified or enriched from the supernatant using conventional techniques such as 

25 ammonium sulfate precipitation, standard chromatography, immunoprecipitatton, 
immunochromatography, size exclusion chromatography, ion exchange chromatography, and HPLC. 
Alternatively, the secreted protein can be in a sufficiently enriched or pure state in the supernatant or 
growth media of the host to permit it to be used for its intended purpose without further enrichment 
The purity of the protein product obtained can be assessed using techniques such as Coomassie or 

3 0 silver staining or using antibodies against the control protein. Coomassie and silver staining techniques 
are familiar to those skilled in the art 

Antibodies capable of specifically recognizing the protein of interest can be generated using 
synthetic peptides using methods well known in the art See, Antibodies: A Laboratory Manual, 
(Harlow and Lane, Eds.) Cold Spring Harbor Laboratory (1988). For example, 15-mer peptides having 

35 a sequence encoded'by the appropriate identified gene sequence of interest or portion thereof can be 
chemically synthesized. The synthetic peptides are injected into mice to generate antibodies to the 
polypeptide encoded by the identified nucleic acid sequence of interest or portion thereof. 
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Alternatively, samples of the protein expressed from the expression vectors discussed above can be 
purified and subjected to amino acid sequencing analysis to confirm the identity of the recombinantry 
expressed protein and subsequently used to raise antibodies. An Example describing in detail the 
generation of monoclonal and polyclonal antibodies appears in Example 7. 
5 The protein encoded by the identified nucleic acid sequence of interest or portion thereof can 

be purified using standard immunochromatography techniques. In such procedures, a solution 
containing the secreted protein, such as the culture medium or a cell extract, is applied to a column 
having antibodies against the secreted protein attached to the chromatography matrix. Hie secreted 
protein is allowed to bind the immunochromatography column. Thereafter, the column is washed to 

10 remove non- specifically bound proteins. The specifically bound secreted protein is then released from 
the column and recovered using standard techniques. These procedures are well known in the art. 

In an alternative protein purification scheme, the identified nucleic acid sequence of interest or 
portion thereof can be incorporated into expression vectors designed for use in purification schemes 
employing chimeric polypeptides. In such strategies the coding sequence of the identified nucleic acid 

1 5 sequence of interest or portion thereof is inserted in-frame with the gene encoding the other half of the 
chimera. The other half of the chimera can be maltose binding protein (MB?) or a nickel binding 
polypeptide encoding sequence. A chromatography matrix having antibody to MBP or nickel attached 
thereto is then used to purify the chimeric protein. Protease cleavage sites can be engineered between 
the MBP gene or the nickel binding polypeptide and the identified expected gene of interest, or portion 

20 thereof. Thus, the two polypeptides of the chimera can be separated from one another by protease 
digestion. 

One useful expression vector for generating maltose binding protein fusion proteins is pMAL 
(New England Biolabs), which encodes the malE gene. In the pMal protein fusion system, the cloned 
gene is inserted into a pMal vector downstream from the malE gene. This results in the expression of 
25 an MBP-fusion protein. The fusion protein is purified by affinity chromatography. These techniques 
as described are well known to those skilled in the art of molecular biology. 

EXAMPLE7 
Production of an Antibody to an isolated E coli Protein 
Substantially pure protein or polypeptide is isolated from the transformed cells as described in Example 
30 6. The concentration of protein in the final preparation is adjusted, for example, by concentration on a 
10,000 molecular weight cut off AMICON filter device (Millipore, Bedford, MA), to the level of a few 
micrograms/ml. Monoclonal or polyclonal antibody to the protein can then be prepared as follows: 
Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can 
35 be prepared from murine hybridomas according to the classical method of Kohler, G. and Milstein, C, 
Nature 256:495 (1975) or any of the well-known derivative methods thereof. Briefly, a mouse is 
repetitively inoculated with a few micrograms of the selected protein or peptides derived therefrom 



-51- 



WO 01/48209 PCT/USO0/34419 

over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the 
spleen isolated Hie spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, 
and the excess unfused cells destroyed by growth of the system on selective media comprising 
aminopterin (HAT media). The successfully fused cells are diluted and aliquots of the dilution placed 
5 in wells of a microtiter plate where growth of the culture is continued. Antibody-producing clones are 
identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures, 
such as ELISA, as described by Engvall, E., "Enzyme immunoassay ELISA and EMIT," Meth. 
EnzymoL 70:419 (1980), and derivative methods thereof. Selected positive clones can be expanded 
and their monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody 
10 production are described in Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New York. 
Section 21-2. 

Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein or a 
peptide can be prepared by immunizing suitable animals with the expressed protein or peptides derived 
\ 15 therefrom described above, which can be unmodified or modified to enhance immunogenicity. 
Effective polyclonal antibody production is affected by many factors related both to the antigen and the 
host species. For example, small molecules tend to be less immunogenic than larger molecules and can 
require the use of carriers and adjuvant. Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
20 level) of antigen administered at multiple intradermal sites appears to be most reliable. An effective 
immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 
33:98^-991(1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody 
titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion in agar 
25 against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, O. et al., 
Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) Blackwell (1973). Plateau 
concentration of antibody is usually in the range of 0.1 to 0 2 mg/ml of serum (about 12jiM). Affinity 
of the antisera for the antigen is determined by preparing competitive binding curves, as described, for 
example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, 
30 Eds.) Amer. Soc. For Microbiol., Washington, D.C. (1980). 

Antibody preparations prepared according to either protocol are useful in quantitative 
immunoassays which determine concentrations of antigen-bearing substances in biological samples; 
they are also used semi-quantitatively or qualitatively to identify the presence of antigen in a biological 
sample. The antibodies can also be used in therapeutic compositions for killing bacterial cells 
. 35 expressing the protein. 
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EXAMPLE 8 

Screening Chemical Libraries 

A. Protein-Based Assays 

Having isolated and expressed bacterial proteins shown to be required for bacterial 
5 proliferation, the present invention further contemplates the use of these expressed proteins in assays to 
screen libraries of compounds for potential drug candidates. The generation of chemical libraries is 
well known in the art For example combinatorial chemistry can be used to generate a library of 
compounds to be screened in the assays described herein. A combinatorial chemical library is a 
collection of diverse chemical compounds generated by either chemical synthesis or biological 

10 synthesis by combining a number of chemical "building blocks" reagents. For example, a linear 
combinatorial chemical library such as a polypeptide library is formed by combining amino acids in 
every possible combination to yield peptides of a given length. Millions of chemical compounds 
theoretically can be synthesized through such combinatorial use of chemical building blocks. For 
example, one commentator observed that the systematic, combinatorial mixing of 1 00 interchangeable 

1 5 chemical building blocks results in the theoretical synthesis of 1 00 million tetrameric compounds or 1 0 
billion pentameric compounds. (Gallop et al., "Applications of Combinatorial Technologies to Drug 
Discovery, Background and Peptide Combinatorial Libraries," Journal of Medicinal Chemistiy, Vol. 
37, No. 9, 1233-1250 (1994). Other chemical libraries known to those in the art may also be used, 
including natural product libraries. 

20 Once generated, combinatorial libraries can be screened for compounds that possess desirable 

biological properties. For example, compounds which may be useful as drugs or to develop drugs 
would likely have the ability to bind to the target protein identified, expressed and purified as discussed 
above. Further, if the identified target protein is an enzyme, candidate compounds would likely 
interfere with the enzymatic properties of the target protein. Any enzyme can be a target protein. For 

25 example, the enzymatic function of a target protein can be to serve as a protease, nuclease, 
phosphatase, dehydrogenase, transporter protein, transcriptional enzyme, and any other type of 
enzyme known or unknown. Thus, the present invention contemplates using the protein products 
described above to screen combinatorial and other chemical libraries. 

Those in the art will appreciate thai a number of techniques exist for characterizing target 

30 proteins in order to identify molecules useful for the discovery and development of therapeutics. For 
example, some techniques involve the generation and use of small peptides to probe and analyze target 
proteins both biochemically and genetically in order to identify and develop drug leads. Such 
techniques include the methods described in PCT publications No. W09935494, W09819162, 
W09954728. 

35 In another example, the target protein is a serine protease and the substrate of the enzyme is 

known. The present example is directed towards the analysis of libraries of compounds to identify 
compounds that function as inhibitors of the target enzyme. First, a library of small molecules is 
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generated using methods of combinatorial library formation well known in the ait U.S. Patent NOs. 
5,463,564 and 5,574, 656, to Agrafiotis, et al., entitled "System and Method of Automatically 
Generating Chemical Compound with Desired Properties," are two such teachings. Then the library 
compounds are screened to identify library compounds that possess desired structural and functional 
5 properties. U.S. Patent No. 5,684,7 1 1 , also discusses a method for screening libraries. 

To illustrate the screening process, die combined target and chemical compounds of the library 
are exposed to and permitted to interact with the purified enzyme. A labeled substrate is added to the 
incubation. The label on the substrate is such that a detectable signal is emitted from metabolized 
substrate molecules. The emission of this signal permits one to measure the effect of the combinatorial 
10 library compounds on the enzymatic activity of target enzymes. The characteristics of each library 
compound is encoded so that compounds demonstrating activity against the enzyme can be analyzed 
and features common to the various compounds identified can be isolated and combined into future 
iterations of libraries. 

Once a library of compounds is screened, subsequent libraries are generated using those 
15 chemical building blocks that possess the features shown in the first round of screen to have activity 
against the target enzyme. Using this method, subsequent iterations of candidate compounds will 
possess more and more of those structural and functional features required to inhibit the function of the 
target enzyme, until a group of enzyme inhibitors with high specificity for the enzyme can be found. 
These compounds can thai be further tested for their safety and efficacy as antibiotics for use in 
20 mammals. 

It will be readily appreciated that this particular screening methodology is exemplary only. 
Other methods are well known to those skilled in the art For example, a wide variety of screening 
techniques are known for a large number of naturally-occurring targets when the biochemical 
function of the target protein is known. 

25 B. Cell-based Assays 

Current cell-based assays used to identify or to characterize compounds for drug discovery 
and development frequently depend on detecting the ability of a test compound to inhibit the 
activity of a target molecule located within a cell or located on the surface of a cell. An advantage 
of cell-based assays is that they allow the effect of a compound on a target molecule's activity to be 

30 detected within the physiologically relevant environment of the cell as opposed to an in vitro 
environment Most often such target molecules are proteins such as enzymes, receptors and the 
like. However, target molecules may also include other molecules such as DNAs, lipids, 
carbohydrates and RNAs including messenger KNAs, ribosomal RNAs, tRNAs and the like. A 
number of highly sensitive cell-based assay methods are available to those of skill in the art to 

35 detect binding and interaction of test compounds with specific target molecules. However, these 
methods are generally not highly effective when the test compound binds to or otherwise interacts 
with its target molecule with moderate or low affinity. In addition, the target molecule may not be 
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readily accessible to a test compound in solution, such as when the target molecule is located inside 
the cell or within a cellular compartment such as the periplasm of a bacterial cell. Thus, current 
cell-based assay methods are limited in that they are not effective in identifying or characterizing 
compounds that interact with their targets with moderate to low affinity or compounds that interact 
5 with targets that are not readily accessible. 

Cell-based assay methods of the present invention have substantial advantages over current 
cell-based assays practiced in the art These advantages derive from the use of sensitized cells in 
which the level or activity of a proliferation-required gene product (the target molecule) has been 
specifically reduced to the point where the presence or absence of its function becomes a rate- 
10 determining step for cellular proliferation. Bacterial, fungal, plant, or animal cells can all be used 
with the present method. Such sensitized cells become much more sensitive to compounds that are 
active against the affected target molecule. Thus, cell-based assays of the present invention are 
capable of detecting compounds exhibiting low or moderate potency against the target molecule of 
interest because such compounds are substantially more potent on sensitized cells than on non- 
15 sensitized cells. The affect may be such that a test compound may be two to several times more 
potent, at least 10 times more potent, at least 20 times more potent, at least 50 times more potent, at 
least 100 times more potent, at least 1000 times more potent, or even more than 1000 times more 
potent when tested on the sensitized cells as compared to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic . 
20 microorganisms and to the significant side-effects associated with some currently used antibiotics, 
novel antibiotics acting at new targets are highly sought after in the art. Yet, another limitation in 
the current art related to cell-based assays is the problem of identifying hits against the same kinds 
of target molecules in the same limited set of biological pathways over and over again. This may 
occur when compounds acting at such new targets are discarded, ignored or fail to be detected 
25 because compounds acting at the "old" targets are encountered more frequently and are more potent 
than compounds acting at the new targets. As a result, the majority of antibiotics in use currently 
interact with a relatively small number of target molecules within an even more limited set of 
biological pathways. 

The use of sensitized cells of the current invention provides a solution to the above problem 
30 in two ways. First, desired compounds acting at a target of interest, whether a new target or a 
previously known but poorly exploited target, can now be detected above the "noise" of compounds 
acting at the "old" targets due to the specific and substantial increase in potency of such desired 
compounds when tested on the sensitized cells of the current invention. Second, the methods used 
to sensitize cells to compounds acting at a target of interest may also sensitize these cells to 
35 compounds acting at other target molecules within the same biological pathway. For example, 
expression of an antisense molecule to a gene encoding a ribosomal protein is expected to sensitize 
the cell to compounds acting at that ribosomal protein and may also sensitize the cells to 
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compounds acting at any of the ribosomal components (proteins or rRNA) or even to compounds 
acting at any target which is part of the protein synthesis pathway. Thus an important advantage of 
the present invention is the ability to reveal new targets and pathways that were previously not 
readily accessible to drug discovery methods. 
5 Sensitized cells of the present invention are prepared by reducing die activity or level of a 

target molecule. The target molecule may be a gene product, such as an RNA or polypeptide 
produced from the proliferation-required nucleic acids described herein. Alternatively, the target 
may be a gene product such as an RNA or polypeptide which is produced from a sequence within 
the same operon as the proliferation-required nucleic acids described herein. In addition, the target 

1 0 may be an RNA or polypeptide in the same biological pathway as the proliferation-required nucleic 
acids described herein. Such biological pathways include, but are not limited to, enzymatic, 
biochemical and metabolic pathways as well as pathways involved in the production of cellular 
structures such the cell wall. 

Current methods employed in the arts of medicinal and combinatorial chemistries are able 

15 to make use of structure-activity relationship information derived from testing compounds in 
various biological assays including direct binding assays and cell-based assays. Occasionally 
compounds are directly identified in such assays that are sufficiently potent to be developed as 
drugs. More often, initial hit compounds exhibit moderate or low potency. Once a hit compound is 
identified with low or moderate potency, directed libraries of compounds are synthesized and tested 

20 in order to identify more potent leads. Generally these directed libraries are combinatorial chemical 
libraries consisting of compounds with structures related to the hit compound but containing 
systematic variations including additions, subtractions and substitutions of various structural 
features. When tested for activity against the target molecule, structural features are identified that 
either alone or in combination with other features enhance or reduce activity. This information is 

25 used to design subsequent directed libraries containing compounds with enhanced activity against 
the target molecule. After one or several iterations of this process, compounds with substantially 
increased activity against the target molecule are identified and may be further developed as drugs. 
This process is facilitated by use of the sensitized cells of the present invention since compounds 
acting at the selected targets exhibit increased potency in such cell-based assays, thus; more 

30 compounds can now be characterized providing more useful information than would be obtained 
otherwise. 

Thus, it is now possible using cell-based assays of the present invention to identify or 
characterize compounds that previously would not have been readily identified or characterized 
including compounds that act at targets that previously were not readily exploited using cell-based 
35 assays. The process of evolving potent drug leads from initial hit compounds is also substantially 
improved by the cell-based assays of the present invention because, for the same number of test 
compounds, more structure-function relationship information is likely to be revealed. 
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The method of sensitizing a cell entails selecting a suitable gene or operon. A suitable gene 
or operon is one whose expression is required for the proliferation of the cell to be sensitized. The 
next step is to introduce into the cells to be sensitized, an antisense RNA capable of hybridizing to 
the suitable gene or operon or to the RNA encoded by the suitable gene or operon. Introduction of 

5 the antisense RNA can be in the form of an expression vector in which antisense RNA is produced 
under the control of an inducible promoter. The amount of antisense RNA produced is limited by 
varying the inducer concentration to which the cell is exposed and thereby varying the activity of 
the promoter driving transcription of the antisense RNA. Thus, cells are sensitized by exposing 
them to an inducer concentration that results in a sub-lethal level of antisense RNA expression. 

10 In one embodiment of the cell-based assays, the identified exogenous £ coli nucleotide 

sequences of the present invention are used to inhibit the production of a proliferation-required 
protein. Expression vectors producing antisense RNA complementary to identified genes required 
for proliferation are used to limit the concentration of a proliferation-required protein without 
severely inhibiting growth. To achieve that goal, a growth inhibition dose curve of inducer is 

15 calculated by plotting various doses of inducer against the corresponding growth inhibition caused 
by the antisense expression. From this curve, various percentages of antisense induced growth 
inhibition, from 1 to 100% can be determined. If the promoter contained in the expression vector 
contains a lac operator the transcription is regulated by lac repressor and expression from the 
promoer is inducible with EPTG. For example, the highest concentration of the inducer EPTG that 

20 does not reduce the growth rate significantly (0% growth inhibition) can be predicted from the 
curve. Cellular proliferation can be monitored by growth medium turbidity via OD measurements. 
In another example, the concentration of inducer that reduces growth by 25% can be predicted from 
the curve. In still another example, a concentration of inducer that reduces growth by 50% can be 
calculated. Additional parameters such as colony forming units (cfu) can be used to measure 

25 cellular viability. 

Cells to be assayed are exposed to the above-determined concentrations of inducer. The 
presence of the inducer at this sub-lethal concentration reduces the amount of the proliferation 
required gene product to a low amount in the cell that will limit but not prevent growth. Cells 
grown in the presence of this concentration of inducer are therefore specifically more sensitive to 

30 inhibitors of the proliferationrrequired protein or RNA of interest or to inhibitors of proteins or 
RNAs in the same biological pathway as the proliferation-required protein or RNA of interest but 
not to inhibitors of unrelated proteins or RNAs. 

Cells pretreated with sub-inhibitory concentrations of inducer and thus containing a reduced 
amount of proliferation-required target gene product are then used to screen for compounds that 

35 reduce cell growth. He sub-lethal concentration of inducer may be any concentration consistent 
with the intended use of the assay to identify candidate compounds to which the cells are more 
sensitive. For example, the sub-lethal concentration of the inducer may be such that growth 
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inhibition is at least about 5%, at least about 8%, at least about 10%, at least about 20%, at least 
about 30%, at least about 40%, at least about 50%, at least about 60% at least about 75%, 90%, 
95% or more. Cells which are pre-sensitized using the preceding method are more sensitive to 
inhibitors of the target protein because these cells contain less target protein to be inhibited than do 
5 wild-type cells. 

In another embodiment of the cell-based assays of the present invention, the level or 
activity of a proliferation required gene product is reduced using a mutation, such as a temperature 
sensitive mutation, in the proliferation-required sequence and an antisense nucleic acid 
complementary to the proliferation-required sequence. Growing the cells at an intermediate 

10 temperature between the permissive and restrictive temperatures of the temperature sensitive 
mutant where the mutation is in a proliferation-required gene produces cells with reduced activity 
of the proliferation-required gene product. The antisense RNA complementary to the proliferation- 
required sequence further reduces the activity of the proliferation-required gene product. Drugs that 
may not have been found using either the temperature sensitive mutation or the antisense nucleic 

15 acid alone may be identified by determining whether cells in which expression of the antisense 
nucleic acid has been induced and which are grown at a temperature between the permissive 
temperature and the restrictive temperature are substantially more sensitive to a test compound than 
cells in which expression of the antisense nucleic acid has not been induced and which are grown at 
a permissive temperature. Also drugs found previously from either the antisense nucleic acid alone 

20 or the temperature sensitive mutation alone may have a different sensitivity profile when used in 
cells combining the two approaches, and that sensitivity profile may indicate a more specific action 
of the drug in inhibiting one or more activities of the gene product. 

Temperature sensitive mutations may be located at different sites within the gene and 
correspond to different domains of the protein. For example, the dnaB gene of Escherichia coli 

25 encodes the replication fork DNA helicase. DnaB has several domains, including domains for 
oligomerization, ATP hydrolysis, DNA binding, interaction with primase, interaction with DnaC, 
and interaction with DnaA [(Biswas, E.E. and Biswas, S.B. 1999 Mechanism and DnaB helicase of 
Escherichia coli: structural domains involved in ATP hydrolysis, DNA binding, and 
oligomemation. Biochem. 38:10919-10928; Hiasa, H. and Marians, KJ. 1999 Initiation of 

30 bidirectional replication at the chromosomal origin is directed by the interaction between helicase 
and primase. J. Biol. Chem. 274:27244-27248; San Martin, C, Radennacher, M., Wolpensinger, 
B., Engel, A., Miles, C.S., Dixon, N.E., and Carazo, J.M. 1998 Three-dimensional reconstructions 
from cryoelectron microscopy images reveal an intimate complex between helicase DnaB and its 
loading partner DnaC. Structure 6:501-9; Sutton, M.D., Carr, K.M., Vicente, M., and Kaguni, JM. 

35 1998 Escherichia coli DnaA protein. The N-terminal domain and loading of DnaB helicase at the R 
coli chromosomal. J. Biol. Chem. 273:34255-62)]. Temperature sensitive mutations in different 
domains of DnaB confer different phenotypes at the restrictive temperature, which include either an 
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abrupt stop or slow stop in DNA replication with or without DNA breakdown (Wechsler, JJL and 
Gross, J.D. 1971. Escherichia coli mutants temperature-sensitive for DNA synthesis. Mol. Gen. 
Genetics 113:273-284) and termination of growth or cell death. Combining the use of temperature 
sensitive mutations in the dnaB gene that cause cell death at the restrictive temperature with an 
S antisense to the dnaB gene could lead to the discovery of very specific and effective inhibitors of 
one or a subset of activities exhibited by DnaB. 

It will be appreciated that the above method may be performed with any mutation which 
reduces but does not eliminate the activity or level of the gene product which is required for 
proliferation. 

10 When screening for antimicrobial agents against a gene product required for proliferation, 

growth inhibition of cells containing a limiting amount of that proliferation-required gene product 
can be assayed. Growth inhibition can be measured by directly comparing the amount of growth, 
measured by the optical density of the growth medium, between an experimental sample and a 
control sample. Alternative methods for assaying cell proliferation include measuring green 

1 5 fluorescent protein (GFP) reporter construct emissions, various enzymatic activity assays, and other 
methods well known in the art 

It will be appreciated that the above method may be performed in solid phase, liquid phase 
or a combination of the two. For example, cells grown on nutrient agar containing the inducer of the 
antisense construct may be exposed to compounds spotted onto the agar surface. A compound's 

20 effect may be judged from the diameter of the resulting killing zone, the area around the compound 
application point in which cells do not grow. Multiple compounds may be transferred to agar plates 
and simultaneously tested using automated and semi-automated equipment including but not 
restricted to multi-channel pipettes (for example the Bee km an Multimek) and multi-channel 
spotters (for example the Genomic Solutions Flexys). In this way multiple plates and thousands to 

25 millions of compounds may be tested per day. 

The compounds may also be tested entirely in liquid phase using microliter plates as 
described below. Liquid phase screening may be performed in microtiter plates containing 96, 384, 
1536 or more wells per microtiter plate to screen multiple plates and thousands to millions of 
compounds per day. Automated and semi-automated equipment may be used for addition of 

30 reagents (for example cells and compounds) and determination of cell density. 

EXAMPLE 9 

Cell-based Assay Using Antisense Complementary to Genes Encoding Ribosomal Proteins 
Hie effectiveness of the above cell-based assay was validated using constructs expressing 
antisense RNA to the proliferation required E. coli genes rplL, rpU, and rplW encoding ribosomal 
35 proteins L7/L12, L10 and L23 respectively. These proteins are part of the protein synthesis 
apparatus of the cell and as such are required for proliferation. These constructs were used to test 
the effect of antisense expression on cell sensitivity to antibiotics known to bind to the ribosome 
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and thereby inhibit protein synthesis. Constructs expressing antisense RNA to several other genes 
(elaD, visC, yohH, and atpE/B\ the products of which are not involved in protein synthesis were 
used for comparison. 

First, pLEX5BA (Krause et ah, J. Mol. BioL 274: 365 (1997)) expression vectors 
5 containing antisense constructs to either rplWor to elaD were introduced into separate E. coli cell 
populations. Vector introduction is a technique well known to those of ordinary skill in the art. 
The expression vectors of this example contain IPTG inducible promoters that drive the expression 
of the antisense RNA in the presence of the inducer. However, those skilled in the art will 
appreciate that other inducible promoters may also be used. Suitable expression vectors are also 

10 well known in the art. The E. coli antisense clones to genes encoding ribosomal proteins L7/L12, 
L10 and L23 were used to test the effect of antisense expression on cell sensitivity to the antibiotics 
known to bind to these proteins. Expression vectors containing antisense to either the genes 
encoding L7/L12 and L10 or L23 were introduced into separate E coli cell populations. 

The cell populations were exposed to a range of IPTG concentrations in liquid medium to 

15 obtain the growth inhibitory dose curve for each clone (Fig. 1). First, seed cultures were grown to a 
particular turbidity that is measured by the optical density (OD) of the growth solution. The OD of 
the solution is directly related to the number of bacterial cells contained therein. Subsequently, 
sixteen 200 ul liquid medium cultures were grown in a 96 well microtiter plate at 37° C with a 
range of IPTG concentrations in duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final 

20 concentration). Additionally, control cells were grown in duplicate without IPTG. These cultures 
were started from equal amounts of cells derived from the same initial seed culture of a clone of 
interest The cells were grown for up to 15 hours and the extent of growth was determined by 
measuring the optical density of the cultures at 600 nm. When the control culture reached mid-log 
phase the percent growth (relative to the control culture) for each of the IPTG containing cultures 

25 was plotted against the log concentrations of EPTG to produce a growth inhibitory dose response 
curve for the IPTG. The concentration of IPTG that inhibits cell growth to 50% (IC 5 o) as compared 
to the 0 mM IPTG control (0% growth inhibition) was then calculated from the curve. Under these 
conditions, an amount of antisense RNA was produced that reduced the expression levels of rplW 
and elaD to a degree such that growth was inhibited by 50%. 

30 Alternative methods of measuring growth are also contemplated. Examples of these 

methods include measurements of proteins, the expression of which is engineered into the cells 
being tested and can readily be measured. Examples of such proteins include green fluorescent 
protein (GFP) and various enzymes. 

Cells were pretreated with the selected concentration of IPTG and then used to test the 

35 sensitivity of cell populations to tetracycline, erythromycin and other protein synthesis inhibitors. 
Figure 2 is an IPTG dose response curve in E coli transformed with an BPTG-inducible plasmid 
containing either an antisense clone to the E coli rplW gene (AS-rplW) which encodes ribosomal 
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protein L23 which is required for protein synthesis and essential for cell proliferation, or an 
antisense clone to the elaD (AS-elaD) gene which is not known to be involved in protein synthesis 
and which is also essential for proliferation. 

An example of a tetracycline dose response curve is shown in Figures 2A and 2B for the 
5 rplW and elaD genes, respectively. Cells were grown to log phase and then diluted into media 
alone or media containing EPTG at concentrations which give 20% and 50% growth inhibition as 
determined by IPTG dose response curves. After 2.5 hours, the cells were diluted to a final ODeoo 
of 0.002 into 96 well plates containing (1) +/- IPTG at the same concentrations used for the 2.5 hour 
pre-incubation; and (2) serial two-fold dilutions of tetracycline such that the final concentrations of 

10 tetracycline range from 1 jig/ml to 15.6 ng/ml and 0 iig/ml. The 96 well plates were incubated at 
37°C and the OD^oo was read by a plate reader every 5 minutes for up to 15 hours. For each IPTG 
concentration and the no IPTG control, tetracycline dose response curves were determined when the 
control (absence of tetracycline) reached 0.1 OD^. To compare tetracycline sensitivity with and 
without IPTG, tetracycline IC JOs were determined from the dose response curves (Figs. 3A-B). 

15 Cells with reduced levels of L23 (AS-rplW) showed increased sensitivity to tetracycline (Fig. 2A) 
as compared to cells with reduced levels of the elaD gene product (AS-elaD) (Fig. 2B). Figure 3 
shows a summary bar chart in which the ratios of tetracycline IC 50s determined in the presence of 
IPTG which gives 50% growth inhibition versus tetracycline ICsos determined without IPTG (fold 
increase in tetracycline sensitivity) were plotted. Cells with reduced levels of either L7/L12 

20 (encoded by genes rplL 7 rplJ) or L23 (encoded by the rplW gene) showed increased sensitivity to 
tetracycline (Fig. 3). Cells expressing antisense to genes not known to be involved in protein 
synthesis (AS~atpB/E 9 AS-vwC, AS-e/oD, AS-yohH) did not show the same increased sensitivity to 
tetracycline, validating the specificity of this assay (Fig. 3). 

In addition to the above, it has been observed in initial experiments that clones expressing 

25 antisense RNA to genes involved in protein synthesis (including genes encoding ribosomal proteins 
L7/L12 & L10, L7/L12 alone, L22, and LI 8, as well as genes encoding rRNA and Elongation 
Factor G) have increased sensitivity to the macrolide, erythromycin, whereas clones expressing 
antisense to the non-protein synthesis genes elaD, atpB/E and visC do not Furthermore, the clone 
expressing antisense to rplL and rpU does not show increased sensitivity to nalidixic acid and 

30 ofloxacin, antibiotics which do not inhibit protein synthesis. 

The results with the ribosomal protein genes rplL, rpU 9 and rplW as well as the initial 
results using various other antisense clones and antibiotics show that limiting the concentration of 
an antibiotic target makes cells more sensitive to the antimicrobial agents that specifically interact 
with that protein. The results also show that these cells are sensitized to antimicrobial agents that 

35 inhibit the overall function in which the protein target is involved but are not sensitized to 
antimicrobial agents that inhibit other functions. 
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The cell-based assay described above may also be used to identify the biological pathway 
in which a proliferation-required nucleic acid or its gene product lies. In such methods, cells 
expressing a sub-lethal level of antisense to a target proliferation-required nucleic acid and control 
cells in which expression of the antisense has not been induced are contacted with a panel of 
5 antibiotics known to act in various pathways. If the antibiotic acts in the pathway in which the 
target proliferation-required nucleic acid or its gene product lies, cells in which expression of the 
antisense has been induced will be more sensitive to the antibiotic than cells in which expression of 
the antisense has not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of cells 

10 expressing antisense nucleic acids to many different proliferation-required genes including the 
target proliferation-required gene. If the antibiotic is acting specifically, heightened sensitivity to 
the antibiotic will be observed only in the cells expressing antisense to a target proliferation- 
required gene (or cells expressing antisense to other proliferation-required genes in the same 
pathway as the target proliferation-required gene) but will not be observed generally in all cells 

1 5 expressing antisense to proliferation-required genes. 

Similarly, the above method may be used to determine the pathway on which a test 
compound, such as a test antibiotic acts. A panel of cells, each of which expresses antisense to a 
proliferation-required nucleic acid in a known pathway, is contacted with a compound for which it 
is desired to determine the pathway on which it acts. The sensitivity of the panel of cells to the test 

20 compound is determined in cells in which expression of the antisense has been induced and in 
control cells in which expression of the antisense has not been induced. If the test compound acts 
on the pathway on which an antisense nucleic acid acts, cells in which expression of the antisense 
has been induced will be more sensitive to the compound than cells in which expression of the 
antisense has not been induced. In addition, control cells in which expression of antisense to 

25 proliferation-required genes in other pathways has been induced will not exhibit heightened 
sensitivity to the compound. In this way, the pathway on which the test compound acts may be 
determined. 

The Example below provides one method for performing such assays. 

EXAMPLE 10 

30 Identification of the Pathway in which a Proliferation-Required 

Gene Lies or the Pathway on which an Antibiotic Acts 
A. Preparation of Bacterial Stocks for Assay 

To provide a consistent source of cells to screen, frozen stocks of host bacteria containing 
the desired antisense construct are prepared using standard microbiological techniques. For 
35 example, a single clone of the organism can be isolated by streaking out a sample of the original 
stock onto an agar plate containing nutrients for cell growth and an antibiotic for which the 
antisense construct contains a gene which confers resistance. After overnight growth an isolated 
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colony is picked from the plate with a sterile needle and transferred to an appropriate liquid growth 
media containing the antibiotic required for maintenance of the plasmid The cells are incubated at 
30°C to 37°C with vigorous shaking for 4 to 6 hours to yield a culture in exponential growth. Sterile 
glycerol is added to 15% (volume to volume) and lOOpL to 500 \xL aliquots are distributed into 
5 sterile cryotubes, snap frozen in liquid nitrogen, and stored at -80°C for future assays. 

B. Growth of Bacteria for Use in the Assay 

A day prior to an assay, a stock vial is removed from the freezer, rapidly thawed (37°C 
water bath) and a loop of culture is streaked out on an agar plate containing nutrients for cell growth 
and an antibiotic to which the antisense construct confers resistance. After overnight growth at 

10 37°C, ten randomly chosen, isolated colonies are transferred from the plate (sterile inoculum loop) 
to a sterile tube containing 5 mL of LB medium containing the antibiotic to which the antisense 
vector confers resistance. After vigorous mixing to form a homogeneous cell suspension, the optical 
density of the suspension is measured at 600 nm (ODeoo) and if necessary an aliquot of the 
suspension is diluted into a second tube of 5 mL, sterile, LB medium plus antibiotic to achieve an 

15 OD 6 oo ^ 0.02 absorbance units. The culture is then incubated at 37° C for 1-2 hrs with shaking until 
the OD600 reaches OD 0.2 - 0.3. At this point the cells are ready to be used in the assay. 

C. Selection of Media to be Used in Assay 

Two fold dilution series of the inducer are generated in culture media containing the 
appropriate antibiotic for maintenance of the antisense construct. Several media are tested side by 

20 side and three to four wells are used to evaluate the effects of the inducer at each concentration in 
each media. For example, M9 minimal media, LB broth, TBD broth and Muller-Hinton media may 
be tested with the inducer IPTG at the following concentrations, 50 fiM, 1 00 ^M, 200 nM, 400 (iM, 
600 jiM, 800 jiM and 1000 pM. Equal volumes of test media-inducer and cells are added to the 
wells of a 384 well microtiter plate and mixed. The cells are prepared as described above and 

25 diluted 1 : 100 in the appropriate media containing the test antibiotic immediately prior to addition to 
the microtiter plate wells. For a control, cells are also added to several wells of each media that do 
not contain inducer, for example 0 \iM IPTG. Cell growth is monitored continuously by incubation 
at 37°C in a microtiter plate reader monitoring the ODeoo of the wells over an 18-hour period. The 
percent inhibition of growth produced by each concentration of inducer is calculated by comparing 

30 the rates of logarithmic growth against that exhibited by cells growing in media without inducer. 
The medium yielding greatest sensitivity to inducer is selected for use in the assays described 
below. 

D. Measurement of Test Antibiotic Sensitivity in the Absence of Antisense Construct Induction 

Two-fold dilution series of antibiotics with a known mechanism of action are generated in 
35 the culture media selected for further assay development that has been supplemented with the 
antibiotic used to maintain the construct. A panel of test antibiotics known to act on different 
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pathways is tested side by side with three to four wells being used to evaluate the effect of a test 
antibiotic on cell growth at each concentration. Equal volumes of test antibiotic and cells are added 
to the wells of a 384 well microtiter plate and mixed. Cells are prepared as described above using 
the media selected for assay development supplemented with the antibiotic required to maintain the 
S antisense construct and are diluted 1:100 in identical media immediately prior to addition to the 
microtiter plate wells. For a control, cells are also added to several wells that contain the solvent 
used to dissolve the antibiotics but no antibiotic. Cell growth is monitored continuously by 
incubation at 37°C in a microtiter plate reader monitoring the ODeoo of the wells over an 18-hour 
period. The percent inhibition of growth produced by each concentration of antibiotic is calculated 
10 by comparing the rates of logarithmic growth against that exhibited by cells growing in media 
without antibiotic. A plot of percent inhibition against Iog[antibiotic concentration] allows 
extrapolation of an IC50 value for each antibiotic. 

E. Measurement of Test Antibiotic Sensitivity in the Presence of Antisense Construct Inducer 

The culture media selected for use in the assay is supplemented with inducer at 

15 concentrations shown to inhibit cell growth by 50% and 80% as described above and the antibiotic 
used to maintain the construct. Two fold dilution series of the panel of test antibiotics used above 
are generated in each of these media Several antibiotics are tested side by side with three to four 
wells being used to evaluate the effects of an antibiotic on cell growth at each concentration, in each 
media. Equal volumes of test antibiotic and cells are added to the wells of a 384 well microtiter 

20 plate and mixed. Cells are prepared as described above using the media selected for use in the 
assay supplemented with the antibiotic required to maintain the antisense construct The cells are 
diluted 1:100 into two 50 mL aliquots of identical media containing concentrations of inducer that 
have been shown to inhibit cell growth by 50% and 80 % respectively and incubated at 37°C with 
shaking for 2.5 hours. Immediately prior to addition to the microtiter plate wells, the cultures are 

25 adjusted to an appropriate OD 6 oo (typically 0.002) by dilution into warm (37°C) sterile media 
supplemented with identical concentrations of the inducer and antibiotic used to maintain the 
antisense construct For a control, cells are also added to several wells that contain solvent used to 
dissolve test antibiotics but which contain no antibiotic. Cell growth is monitored continuously by 
incubation at 37°C in a microtiter plate reader monitoring the ODeoo of the wells over an 18-hour 

30 period. The percent inhibition of growth produced by each concentration of antibiotic is calculated 
by comparing the rates of logarithmic growth against that exhibited by cells growing in media 
without antibiotic. A plot of percent inhibition against logfantibiotic concentration] allows 
extrapolation of an IC50 value for each antibiotic. 

F. Determining the Specificity of the Test Antibiotics 

35 A comparison of the ICsoS generated by antibiotics of known mechanism of action under 

antisense induced and non-induced conditions allows the pathway in which a proliferation-required 
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nucleic acid lies to be identified. If cells expressing an antisense nucleic acid against a 
proliferation-required gene are selectively sensitive to an antibiotic acting via a particular pathway, 
then the gene against which the antisense acts is involved in the pathway in which the antibiotic 
acts. 

5 G. Identification of Pathway in which a Test Antibiotic Acts 

As discussed above, the cell-based assay may also be used to determine the pathway 
against which a test antibiotic acts. In such an analysis, the pathways against which each member 
of a panel of antisense nucleic acids acts are identified as described above. A panel of cells, each 
containing an inducible nucleic acid complementary to a gene in a known proliferation-required 

10 pathway, is contacted with a test antibiotic for which it is desired to determine the pathway on 
which it acts under inducing an non-inducing conditions. If heightened sensitivity is observed in 
induced cells expressing antisense complementary to a gene in a particular pathway but not in 
induced cells expressing antisense complementary to genes in other pathways, then the test 
antibiotic acts against the pathway for which heightened sensitivity was observed. 

15 One skilled in the art will appreciate that further optimization of the assay conditions, such 

as the concentration of inducer used to induce antisense expression and/or the growth conditions 
used for the assay (for example incubation temperature and media components) may further 
increase the selectivity and/or magnitude of the antibiotic sensitization exhibited. 

The following example confirms the effectiveness of the methods described above. 

20 EXAMPLE 11 

Identification of the Pathway in which a Proliferation-Required Gene Lies 
Antibiotics of various chemical classes and modes of action were purchased from Sigma 
Chemicals (St. Louis, MO). Stock solutions were prepared by dissolving each antibiotic in an 
appropriate aqueous solution based on information provided by the manufacturer. The final 

25 working solution of each antibiotic contained no more than 0.2% (w/v) of any organic solvent. To 
determine their potency against a bacterial strain engineered for expression of an antisense 
complementary to a proliferation-required gene encoding SOS ribosomal protein, each antibiotic 
was serially diluted two or three fold in growth medium supplemented with the appropriate 
antibiotic for maintenance of the anti-sense construct. At least ten dilutions were prepared for each 

30 antibiotic. 25 pL aliquots of each dilution were transferred to discrete wells of a 384-well 
microplate (the assay plate) using a multi-channel pipette. Quadruplicate wells were used for each 
dilution of an antibiotic under each treatment condition (plus and minus inducer). Each assay plate 
contained twenty wells for cell growth controls (growth media replacing antibiotic), ten wells for 
each treatment (plus and minus inducer, in this example IPTG). Assay plates were usually divided 

35 into the two treatments: half the plate containing induced cells and an appropriate concentrations of 
inducer (in this example IPTG) to maintain the state of induction, the other half containing non- 
induced cells in the absence of IPTG. 
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Cells for the assay were prepared as follows. Bacterial cells containing a construct, from 
which expression of antisense nucleic acid complementary to rplL and rplF 9 which encode 
proliferation-required 50S ribosomal subunit proteins, is inducible in the presence of IPTG, were 
grown into exponential growth (OD^o 0.2 to 0.3) and then diluted 1:100 into fresh media 
5 containing either 400 pM or 0 jiM inducer (IPTG). These cultures were incubated at 37° C for 2.5 
hr. After a 2.5 hr incubation, induced and non-induced cells were respectively diluted into an assay 
medium at a final OD 6 oo value of 0.0004. The medium contained an appropriate concentration of 
the antibiotic for the maintenance of the anti-sense construct In addition, the medium used to dilute 
induced cells was supplemented with 800 pM IPTG so that addition to the assay plate would result 

10 in a final IPTG concentration of 400 jiM. Induced and non-induced cell suspensions were 
dispensed (25 ^il/well) into the appropriate wells of the assay plate as discussed previously. The 
plate was then loaded into a plate reader, incubated at constant temperature, and cell growth was 
monitored in each well by the measurement of light scattering at 595 nm. Growth was monitored 
every 5 minutes until the cell culture attained a stationary growth phase. For each concentration of 

15 antibiotic, a percentage inhibition of growth was calculated at the time point corresponding to mid- 
exponential growth for the associated control wells (no antibiotic, plus or minus IPTG). For each 
antibiotic and condition (plus or minus IPTG), a plot of percent inhibition versus log of antibiotic 
concentration was generated and the IC 50 determined. A comparison of the IC 5 o for each antibiotic 
in the presence and absence of IPTG revealed whether induction of the antisense construct 

20 sensitized the cell to the mechanism of action exhibited by the antibiotic. Cells which exhibited a 
significant (standard statistical analysis) numerical decrease in the IC50 value in the presence of 
inducer were considered to have an increased sensitivity to the test antibiotic. 

The results are provided in the table below, which lists the classes and names of the 
antibiotics used in the analysis, the targets of the antibiotics, the IC 50 in the absence of IPTG, the 

25 IC50 in the presence of IPTG, the concentration units for the IC50S, the fold increase in IC50 in the 
presence of IPTG, and whether increased sensitivity was observed in the presence of IPTG. 
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The above results demonstrate that induction of an antisense RNA to genes encoding SOS 
ribosomal subunit proteins results in a selective and highly significant sensitization of cells to 
antibiotics that inhibit ribosomal function and protein synthesis. The above results further 
demonstrate that induction of an antisense construct to an essential gene sensitizes an organism to 
5 compounds that interfere with that gene products' biological role. This sensitization is restricted to 
compounds that interfere with pathways associated with the targeted gene and it's product 

Assays utilizing antisense constructs to essential genes can be used to identify compounds 
that specifically interfere with die activity of multiple targets in a pathway. Such constructs can be 
used to simultaneously screen a sample against multiple targets in one pathway in one reaction 

1 0 (Combinatorial HTS). 

Furthermore, as discussed above, panels of antisense construct containing cells may be used 
to characterize the point of intervention of any compound affecting an essential biological pathway 
including antibiotics with no known mechanism of action. 

Another embodiment of the present invention is a method for determining the pathway 

IS against which a test antibiotic compound is active in which the activity of target proteins or nucleic 
acids involved in proliferation-required pathways is reduced by contacting cells with a sub-lethal 
concentration of a known antibiotic which acts against the target protein or nucleic acid. In one 
embodiment, the target protein or nucleic acid is a target protein or nucleic acid corresponding to a 
proliferation-required nucleic acid identified using the methods described above. The method is 

20 similar to those described above for determining which pathway a test antibiotic acts against except 
that rather than reducing the activity or level of a proliferation-required gene product using a sub- 
lethal level of antisense to a proliferation-required nucleic acid, the activity or level of the 
proliferation-required gene product is reduced using sub-lethal level of a known antibiotic which 
acts against the proliferation required gene product 

25 Interactions between drugs which affect the same biological pathway has been described in 

the literature. For example, Mecillinam (Amdinocillin) binds to and inactivates the penicillin 
binding protein 2 (PBP2, product of the mrdA in K coli). This antibiotic inteacts with other 
antibiotics that inhibit PBP2 as well as antibiotics that inhibit other penicillin binding proteins such 
as PBP3 [(Gutmann, L., Vincent, S., Billot-Klein, D., Acar, JJF., Mrena, E., and Williamson, IL 

30 (1986) Involvement of penicillin-binding protein 2 with other penicillin-binding proteins in lysis of 
Escherichia coli by some beta-lactam antibiotics alone and in synergistic lytic effect of 
amdinocillin (mecillinam). Antimicrobial Agents & Chemotherapy, 30:906-912)]. Interactions 
between drugs could, therefore, involve two drugs that inhibit the same target protein or nucleic 
acid or inhibit different proteins or nucleic acids in the same pathway [(Fukuoka, T., Domon, H., 

35 Kakuta, M., Ishii, C, Hirasawa, A, Utsui, Y., Ohya, S., and Yasuda, H. (1997) Combination effect 
between panipenem and vancomycin on highly methicillin-resistant Staphylococcus aureus. Japan. 
J. Antibio. 50:41 1-419; Smith, C.E., Foleno, B.E., Barrett, J.F., and Frosc, M.B. (1997) Assessment 
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of the synergistic interactions of levofloxacin and ampicillin against Enterococcus faecwm by the 
checkerboard agar dilution and time-kill methods. Diagnos. Microbiol. Infect Disease 27:85-92; 
den Hollander, J.G., Horrevorts, A.M ., van Goor, MX., Verbrugh, H.A., and Mouton, J.W. (1997) 
Synergism between tobramycin and ceftazidime against a resistant Pseudomonas aeruginosa strain, 
5 tested in an in vitro pharmacokinetic model. Antimicrobial Agents & Chemotherapy. 41 :95-l 10)]. 

Two drugs may interact even though they inhibit different targets. For example, the proton 
pump inhibitor, Omeprazole, and the antibiotic, Amoxycillin, two synergistic compounds acting 
together, can cure Helicobacter pylori infection [( Gabryelewicz, A., Laszewicz, W., 
Dzieniszewski, J., Ciok, J., Marlicz, K., Bielecki, D., Popiela, T., Legutko, J., Knapik, Z., 
10 Poniewierka, E. (1997) Multicenter evaluation of dual-therapy (omeprazol and amoxycillin) for 
Helicobacter pylori-associated duodenal and gastric ulcer (two years of the observation). J. Physiol. 
Pharmacol. 48 Suppl 4:93-105)]. 

The growth inhibition from the sub-lethal concentration of the known antibiotic may be at 
least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 30%, at 
1 5 least about 40%, at least about 50%, at least about 60%, or at least about 75%, or more. 

Alternatively, the sub-lethal concentration of the known antibiotic may be determined by 
measuring the activity of the target proliferation-required gene product rather than by measuring 
growth inhibition. 

Cells are contacted with a combination of each member of a panel of known antibiotics at a 

20 sub-lethal level and varying concentrations of the test antibiotic. As a control, the cells are 
contacted with varying concentrations of the test antibiotic alone. The IC 50 of die test antibiotic in 
the presence and absence of the known antibiotic is determined. If the IC50S in the presence and 
absence of the known drug are substantially similar, then the test drug and the known drug act on 
different pathways. If the IC50S are substantially different, then the test drug and the known drug 

25 act on the same pathway. 

Another embodiment of the present invention is a method for identifying a candidate 
compound for use as an antibiotic in which the activity of target proteins or nucleic acids involved 
in proliferation-required pathways is reduced by contacting cells with a sub-lethal concentration of 
a known antibiotic which acts against the target protein or nucleic acid. In one embodiment, the 

30 target protein or nucleic acid is a target protein or nucleic acid corresponding to a proliferation- 
required nucleic acid identified using the methods described above. The method is similar to those 
described above for identifying candidate compounds for use as antibiotics except that rather than 
reducing tbe activity or level of a proliferation-required gene product using a sub-lethal level of 
antisense to a proliferation-required nucleic acid, the activity or level of the proliferation-required 

35 gene product is reduced using a sub-lethal level of a known antibiotic which acts against the 
proliferation required gene product 
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* The growth inhibition from the sub-lethal concentration of the known antibiotic may be at 

least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 30%, at 
least about 40%, at least about 50%, at least about 60%, or at least about 75%, or more. 

Alternatively, the sub-lethal concentration of the known antibiotic may be determined by 
5 measuring the activity of the target proliferation-required gene product rather than by measuring 
growth inhibition. 

In order to characterize test compounds of interest, cells are contacted with a panel of 
known antibiotics at a sub-lethal level and one or more concentrations of the test compound. As a 
control, the cells are contacted with the same concentrations of the test compound alone. Hie IC 50 

10 of the test compound in the presence and absence of the known antibiotic is determined. If the IC 5 o 
of the test compound is substantially different in the presence and absence of the known drug then 
the test compound is a good candidate for use as an antibiotic. As discussed above, once a 
candidate compound is identified using the above methods its structure may be optimized using 
standard techniques such as combinatorial chemistry. 

15 Representative known antibiotics which may be used in each of the above methods are 

provided in the table below. However, it will be appreciated that other antibiotics may also be used. 



ANTIBIOTIC 


INHIBITS/TARGET 


RESISTANT 
MUTANTS 


Inhibitors of Transcription 






Rifamycin, 1959 


Inhibits initiation oftranscription/D-subunit 


rpoB 9 crp, cyaA 


Rifempicin 


RNA polymerase, rpoB 




Rifabutin 






Rifaximin 






Streptolydigin 


Accelerates transcription chain 


rpoB 




terminal un/fl-subunit RNA polymerase 




Streptovaricin 


an acyclic ansamycin, inhibits RNA 


rpoB 




polymerase 




Actinomycin D+EDTA 


Intercalates between 2 successive G-C 


pldA 




pairs, rpoB, inhibits RNA synthesis 




Inhibitors of Nucleic Acid Metabolism 




Quinolones, 1962 


a subunit gyrase and/or topoisomerase IV, 




Nalidixic acid 


gyrA 


gyrAorB t icd, sloB 


Oxolinic acid 






Fluoroquinolones 


a subunit gyrase, gyrA and/or 


gyrA 


Ciprofloxacin, 1983 


topoisomerase IV (probable target in Staph) 


nor A (efflux in Staph) 


Norfloxacin 




hipQ 


Coumerins 


Inhibits ATPase activity of B-subunit 




Novobiocin 


gyrase, gyrB 


gyrB, cysB, cysE, nov, 






ompA 


Coumermycin 


Inhibits ATPase activity of 6-subunit 


gyrB f hisW 




gyrase, gyrB 




Albicidin 


DNA synthesis 


tsx (nucleoside channel) 


Metronidazole 


Causes single-strand breaks in DNA 


nor 
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Inhibitors of Metabolic Pathways 



Sulfonamides, 1932 

Sulfanilamide 
Trimethoprim, 1962 
Showdomycin ' 



Thiolactomycin 



Psicofuranine 

Triclosan 
Diazoborines 
Isoniazid 
Ethionamide 



blocks synthesis of dihydrofolate,dihydro- 

pteroate synthesis, /o/P 

Inhibits dihydrofolate reductase, folA 

Nucleoside analogue capable of alkylating 

sulfhydryi groups, inhibitor of thymidylate 

synthetase 

type II fatty acid synthase inhibitor 



Adenosine glycoside antibiotic, target is 
GMP synthetase 
Inhibits ratty acid synthesis 
heterocyclic, contains boron, inhibit fatty 
acid synthesis, enoyl-ACP reductase,/^/ 



folP, gpt, pabA, pabB, 
pabC 
folAy thyA 
nupC, pnp 



emrB 

fadB, emrB due to gene 



guaA,B 



. fabl(emM) 
1 fabI(envM) 



Inhibitors of Translation 
Phenylpropanoids 

Chloramphenicol, 1947 



Tetracyclines, 1948, 

type II polyketides 

Minocycline 

Doxycycline 
Macrolides (type I polyketides) 

Erythromycin, 1950 

Carbomycin 

Spiramycin, etc 

Aminoglycosides 
Streptomycin, 1944 
Neomycin 

Spectinomycin 
Kanamycin 

Kasugamycin 

Gentamicin, 1963 

Amikacin 

Paromycin 

Lincosamides 

Lincomycin, 1955 

Clindamycin 
Strepto gram ins 

Virginiamycm, 1955 

Pristinamycm 
Synercid: quinupristin 

/dalfopristin 
Fusidanes 

Fusidic Acid 
Kirromycin 

(Mocimycin) 
Pulvomycin 
TTiiopeptin 

Hamulin 



Binds to ribosomal peptidyl transfer center 
preventing peptide translocation/ binds to 
S6, L3, L6, L14, L16, L25, L26, L27, but 
preferentially to L16 

Binding to 30S ribosomal subunit, U A" site 
on 30S subunit, blocks peptide elongation, 
strongest binding to S7 

Binding to 50 S ribosomal subunit, 23S 
rRNA, blocks peptide translocation, LI 5, 
L4,L12 

Irreversible binding to 3 OS ribosomal 
subunit, prevents translation or causes 
mistranslation of mRNA/16S rRNA 



Binding to 50 S ribosomal subunit, blocks 
peptide translocation 

2 components, Streptogramins A&B, bind 
to the 50S ribosomal subunit blocking 
peptide translocation and peptide bond 
formation 

Inhibition of elongation factor G (EF-G) 
prevents peptide translocation 

Inhibition of elongation factor TU (EF-Tu), 

prevents peptide bond formation 

Binds to and inhibits EF-TU 

Sulfur-containing antibiotic, inhibits protein 
synthesisJEF-G 

Inhibits protein synthesis 



rrn, cmlA, mar A, ompF, 
ompR 

clmA (cmr), mar, ompF 



rrn, rplC, rplD, rplV 
mac 



rpsL, strQM, ubiF 
atpA-E t ecfB, 

hemAC t D,E,G t topA, 
rpsC,D,E t rrn, spcB 
atpA-atpE, cpxA, ecfB, 

hemA,B,L, topA 
ksgA,B,C,D, rp!B,K 

rpslNMR 
rplF, ubiF 
cpxA 
rpsL 

HnB, rpIN.O, rpsG 



JusA 
tofA,B 



rplE 

rplQ rplD 
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Negamycin 

Oxazolidinones 
Linezolid 
Isoniazid 

Nitrofurantoin 



Pseudomonic Acids 
Mupirocin 
(Bactroban) 

Indoknycin 

Viomycin 



Thiopeptides 
Thiostrcpton 
Micrococcin 



Inhibits termination process of protein 

synthesis 
23SrRNA 



Inhibits protein synthesis, nitroreductases 
convert nitrofurantoin to highly reactive 
electrophilic intermediates which attack 
bacterial ribosomal proteins non- 
specifically 

Inhibition of isoleucyl tRNA synthetase- 
used for Staph, topical cream, nasal 
spray 

Inhibits tryptophanyl-tRNA synthetase 



Binds to LI 1-23S RNA complex 
Inhibits GTP hydrolysis by EF-G 
Stimulates GTP hydrolysis by EF-G 



prfB 



pdx 
nfnA,B 



ileS 



trpS 

rrmA (23 S rRNA 



;e; 

mutant has slow 
growth rate, slow 
chain elongation rate, 
and viomycin 
resistance) 



Inhibitors of Cell Walls/Membranes 



fi-lactams 

Penicillin, 1929 
Ampicillin 
Memicillin, 1960 



Cephalosporins, 1962 

Mecillinam 
(amdinocillin) 

Aztreonam 

(Furazlocillin) 
Bacilysin, Tetaine 
Glycopeptides 

Vancomycin, 1955 
Polypeptides 

Bacitracin 
Cyclic lipopeptide 

Daptomycin, 1980 



Cyclic polypeptides 
Polymixin, 1939 
Fosfomycin, 1969 



Inhibition of one or more cell wall 
transpeptidases, endopeptidases, and 

glycosidases (PBPs), of the 12 PBPs only 2 ampC, ampD, ampE, 
are essential: mrdi (PBP2) and ftsl (pbpB, envZ, galU, hipA, 



PBP3) 



Cycloserine 



Binds to and inactivates PBP2 (jnrdA) 

Inactivates PBP3 (ftsl) 

Dipeptide, inhib glucosamine synthase 
Inhib G+ cell wall syn, binds to terminal 
D-ala-D-ala of pentapeptide, 
Prevents dephosphorylation and 
regeneration of lipid carrier 
Disrupts multiple aspects of membrane 
function, including peptidogtycan 
synthesis, hpoteichoic acid synthesis, and 
the bacterial membrane potential 
Surfactant action disrupts cell membrane 
lipids, binds lipid A mioety of LPS 
Analogue of P-enolpyruvate, inhibits 1 st 
step in peptidoglycan synthesis - UDP-N- 
acetylgiucosamine enolpyruvyl 
transferase, murA. Also acts as 
Immunosuppressant 
Prevents formation of D-ala dimer, 
inhibits D-ala ligase, ddlA,B 



hipQ, ompC, ompF, 
ompR, ptsl, rfa, 
tolD, tolE 
tonB 

alaS, argS, crp, cyaA, 
envB, mrdA,B, 
mreB.QD 



dppA 



rfa 



pmrA 



murA, crp, cyaA glpT, 
fiipA, ptsl, uhpT 



hipA, cycA 
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Alafosfalin phosphonodipeptide, cell wall synthesis pepA, tpp 

inhibitor, potentiator of p-lactams 

Inhibitors of Protein Processing/Transport 

Globomycin Inhibits signal peptidase II (cleaves Ipp, dnaE 

prolipoproteins subsequent to lipid 
modification, IspA 



EXAMPLE 12 

Transfer of Exogenous Nucleic Acid Sequences to other Bacterial Species Using the K coli Expression 
5 Vectors or Expression Vectors Functional in Bacterial Species other than E. coll 

Molecule No. EcXA190, encoding a portion of the b3052 gene of Escherichia coli, was 
either transformed directly into Enterobacter cloacae, Salmonella typhimurium and/or Klebsiella 
pneumoniae or subcloned into an expression vector functional in these species and the subclones 
transformed into these organisms. Suitable expression vectors are well known in the art. These 

10 expression vectors were introduced into Enterobacter cloacae, Salmonella typhimurium and/or 
Klebsiella pneumoniae cells that were then assayed for growth inhibition according to the method 
of Example 1. After growth in liquid culture, cells were plated at various serial dilutions and a 
score determined by calculating the log difference in growth for INDUCED vs. UNINDUCED 
antisense RNA expression as determined by the maximum 10 fold dilution at which a colony was 

1 5 observed. If there was no effect of antisense RNA expression in one organism, the clone is given a 
score of zero "0" in that organism. In contrast, a score of "8" means that 10 8 times more cells were 
required to observe a colony formed on the induced state than in the non-induced state under the 
conditions used and in that organism. 

Expression vectors containing Molecule No. EcXA190 were found to inhibit bacterial 

20 growth in all four organisms when expression of the antisense RNA was induced with IPTG. A 
score of 4 was assigned for Escherichia coli, 6 for Enterobacter cloacae, and 8 for Salmonella 
typhimurium and 3 for Klebsiella pneumoniae (obvious additional growth defect as well). The 
protein encoded by this sequence may be used as a target sequence to screen candidate compound 
libraries as described above. 

25 In addition, the above methods were validated using other antisense nucleic acids which 

inhibit the growth of E. coli which were identified using methods similar to those described above. 
Expression vectors which inhibited growth of E. coli upon induction of antisense RNA expression 
with IPTG were transformed directly into Enterobacter cloacae, Klebsiella pneumonia or 
Salmonella typhimurium. The transformed cells were then assayed for growth inhibition according 

30 to the method of Example 1. After growth in liquid culture, cells were plated at various serial 
dilutions and a score determined by calculating the log difference in growth for INDUCED vs. 
UNINDUCED antisense RNA expression as determined by the maximum 10 fold dilution at which 
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a colony was observed. Hie results of these experiments are listed in Table V below. If there was 
no effect of antisense RNA expression in a microorganism, the clone is minus in Table V below. In 
contrast, a positive in Table V below means that at least 10 fold more cells were required to observe 
a colony on the induced plate than on the non-induced plate under the conditions used and in that 
5 microorganism. 

TABLE V 

Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit Proliferation in K coll 



MoL No. 


S. typhimurium 


£L cloacae 


JL pneumoniae 


EcXAQOl 


+ 


+ 




EcXA004 


+ 






EcXA005 


+ 


+ 


+ ! 


EcXA006 








EcXA007 




+ 




EcXA008 


+ 




+ 


EcXA009 








EcXAOlO 


+ 


+ 


+ 


EcXAOll 




+ 




EcXA0l2 




+ 




EcXA013 


+ 


+ 




EcXA014 








EcXAOlS 




+ 


+ 


EcXA016 


+ 


+ 


. + 


EcXA017 


+ 


+ 


+ 


EcXA018 


+ 




+ 


EcXA019 


+ 


+ 


+ 


EcXA020 


+ 


+ 


+ 


EcXA021 


+ 




+ 


EcXA023 


+ 




+ 


EcXA024 


+ 


_ 


+ 


EcXA025 




_ 


_ 


EcXA026 


+ 


+ 




EcXA027 




+ 




EcXA028 


+ 






EcXA029 








EcXA030 


+ 


+ 


+ 


EcXA031 


+ 






EcXA032 


+ 






EcXA033 




+ 


+ 


EcXA034 


+ 


+ 


+ 


EcXA035 








EcXA036 


+ 




+ 


EcXA037 


+ 


+ 




EcXA038 


+ 




+ 


EcXA039 








EcXA041 




+ 


+ 


EcXA042 
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MoL IX 0. 


£ typhunurium 


E. cloacae 


Ju pneumoniae 










D-YA A/f /I 










T 


i 

T 


i 

T 










I7rYA A/17 


4_ 
T 






Co Y A A/1 0 








C*»Y A A/I Q 


i 

T 






C»YA a^a 








Cr»Y A A^ 1 

bCAAUM 


i 

T 






bcXAUjz 


i 

T 






bCAAU53 


i 

T 


+ 


+ 


EcXAOM 






+ 


ECXA055 


+ 




: 


EcXA056 


+ 




+ 


t?*."v a net 

ECXA057 


+ 


+ 




C^V A ACO 

! bcXAOoo 






: 


bcXA059 


+ 


+ 


+ 


T7_"V" A A^TA 








I7_ V A A<Ct 

bcXAUoi 








bcXAObz 









bcXA0o3 


+ 


+ 




bCAA054 






: 


EcaA065 


+ 


+ 


: 


bcXAOoo 






: 


ECAA067 




+ 




T?_V A A£0 

bcXAOoo 








bcXA0o9 




+ 





r_Y A ATA 

bcXA070 









T7rt"V A AH 1 

bcAAO / 1 








ECXA072 


• 

t 




+ 


ECXA073 


+ 




+ 


A ATA 

bcXA074 


+ 


+ 


+ 


T7 _"V* A ATC 

bcXA073 


+ 






bCAAU70 




+ 




I7*»Y A ATT 

bCXA077 


i 

T 


+ 




17 /»Y A A7Q 


i 

*r 


1 

+ 


+ 


T7oYA AQA 


i 

T 






xiCA/VUoZ 




1 
i 
















T 












HCAAUo / 








EcXA088 








i EcXA089 








EcXA090 








EcXA091 








EcXA092 








EcXA093 
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MoL No. 


S. typhimurium 


K cloacae 


EL pneumoniae 


EcXA094 


+ 


+ 


+ 


EcXA095 


+ 


+ 




EcXA096 








EcXA097 


+ 






EcXA098 


+ 






1 EcXA099 








EcXAlOO 








EcXAlOl 








EcXA102 








EcXA103 




+ 




EcXA104 


+ 


+ 


+ 


EcXA106 


+ 


+ 




EcXA107 








EcXA108 








EcXA109 








EcXAllO 


+ 


+ 




EcXAlll 








EcXA112 








EcXA113 


+ 


+ 


+ 


EcXAlH 




+ 




EcXAllS 




+ 




EcXA116 




+ 




EcXA117 


+ 






EcXA118 








EcXA119 




+ 




EcXA120 








EcXA121 
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Thus, the ability of an antisense nucleic acid which inhibits the proliferation of R coli to 
inhibit the growth of other organims may be evaluated by either transforming the antisense nucleic 
acid directly into other Escherichia species or inserting the antisense nucleic acid into expression 
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vectors that are functional in other Gram negative species such as Enterobacter cloacae, Salmonella 
typhimurhim, and/or Klebsiella pneumoniae . Similarly, the antisense nucleic acid can be inserted 
in expression vectors that are functional in Gram-positive species such as Staphylococcus aureus, 
Enterococcus faecalis and Streptococcus pneumoniae or other species. 
5 Those stalled in the art will appreciate that a negative result in a heterologous microorganism 

does not' mean that that microorganism is missing that gene nor does it mean that the gene is 
unessential. However, a positive result means that the heterologous microorganism contains a 
homologous gene which is required for proliferation of that microorganism. The homologous gene 
may be obtained using the methods described herein. Those cells that are inhibited by antisense may 
10 be used in cell-based assays as described herein for the identification and characterization of 
compounds in order to develop antibiotics effective in these microorganisms. Those skilled in the art 
will appreciate that an antisense molecule which works in the microorganism from which it was 
obtained will not always work in a heterologous microorganism. 

EXAMPLE 13 

15 Use of Identified Exogenous Nucleic Acid Sequences as Probes 

The identified sequence of the present invention can be used as probes to obtain the 
sequence of additional genes of interest from a second organism. For example, probes to genes 
encoding potential bacterial target proteins may be hybridized to nucleic acids from other 
organisms including other bacteria and higher organisms, to identify homologous sequences. Such 

20 hybridization might indicate that the protein encoded by the gene to which the probe corresponds is 
found in humans and therefore not necessarily a good drug target. Alternatively, the gene can be 
conserved only in bacteria and therefore would be a good drug target for a broad spectrum 
antibiotic or antimicrobial. 

Probes derived from the identified nucleic acid sequences of interest or portions thereof can be ' 

25 labeled with detectable labels familiar to those skilled in the art, including radioisotopes and non- 
radioactive labels, to provide a detectable probe. The detectable probe can be single stranded or double 
stranded and can be made using techniques known in the ait, including in vitro transcription, nick 
translation, or kinase reactions. A nucleic acid sample containing a sequence capable of hybridizing to 
the labeled probe is contacted with the labeled probe. If the nucleic acid in the sample is double 

30 stranded, it can be denatured prior to contacting the probe. In some applications, the nucleic acid 
sample can be immobilized on a surface such as a nitrocellulose or nylon membrane. The nucleic acid 
sample can comprise nucleic acids obtained from a variety of sources, including genomic DNA, cDNA 
libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucleic acids capable of hybridizing to the detectable 

35 probe include well known techniques such as Southern blotting, Northern blotting, dot blotting, colony 
hybridization, and plaque hybridization. In some applications, the nucleic acid capable of hybridizing 
to the labeled probe can be cloned into vectors such as expression vectors, sequencing vectors, or in 
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vitro transcription vectors to facilitate the characterization and expression of the hybridizing nucleic 
acids in the sample. For example, such techniques can be used to isolate, purify and clone sequences 
from a genomic library, made from a variety of bacterial species, which are capable of hybridizing to 
probes made from the sequences identified in Examples 5 and 6. 
5 EXAMPLE 14 

Preparation of PCR Primers and Amplification of PNA 
The identified R coli genes corresponding directly to or located within the operon of nucleic 
acid sequences required for proliferation or portions thereof can be used to prepare PCR primers for a 
variety of applications, including the identification or isolation of homologous sequences from other 

10 species, for example S. typhimurhtm, E. cloacae, E. faecalis, S. pneumoniae, and JC pneumoniae, 
which contain part or all of the homologous genes. Because homologous genes are related but not 
identical in sequence, those skilled in the art will often employ degenerate sequence PCR primers. 
Such degenerate sequence primers are designed based on conserved sequence regions, either known or 
suspected, such as conserved coding regions. The successful production of a PCR product using 

15 degenerate probes generated from the sequences identified herein would indicate the presence of a 
homologous gene sequence in the species being screened. The PCR primers are at least 10 nucleotides, 
and preferably at least 20 nucleotides in length. More preferably, the PCR primers are at least 20-30 
nucleotides in length. In some embodiments, the PCR primers can be more than 30 nucleotides in 
length. It is preferred that the primer pairs have approximately the same G/C ratio, so that melting 

20 temperatures are approximately the same. A variety of PCR techniques are familiar to those skilled in 
the art For a review of PCR technology, see Molecular Cloning to Genetic Engineering White, B.A. 
Ed. in Methods in Molecular Biology 67: Humana Press, Totowa 1997. When the entire coding 
sequence of the target gene is known, the 5' and 3' regions of the target gene can be used as the 
sequence source for PCR probe generation. In each of these PCR procedures, PCR primers on either 

25 side of the nucleic acid sequences to be amplified are added to a suitably prepared nucleic acid sample 
along with dNTPs and a thermostable polymerase such as Taq polymerase, Pfii polymerase, or Vent 
polymerase. Hie nucleic acid in the sample is denatured and the PCR primers are specifically 
hybridized to complementary nucleic acid sequences in the sample. The hybridized primers are 
extended. Thereafter, another cycle of denaturation, hybridization, and extension is initiated. The 

30 cycles are repeated multiple times to produce an amplified fragment containing the nucleic acid 
sequence between the primer sites. 

EXAMPLE 15 
Inverse PCR 

The technique of inverse polymerase chain reaction can be used to extend the known nucleic 
35 acid sequence identified in Examples 5 and 6. The inverse PCR reaction is described generally by 
Ochman et al., in Ch. 10 of PCR Technology: Principles and Applications for DNA Amplification, 
(Henry A. Erlich, Ed.) W.H. Freeman and Co. (1992). Traditional PCR requires two primers that are 
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used to prime the synthesis of complementary strands of DNA. In inverse PGR, only a core sequence 
need be known. 

Using the sequences identified as relevant from the techniques taught in Examples 5 and 6 and 
applied to other species of bacteria, a subset of exogenous nucleic sequences are identified that 
5 correspond to genes or operons that are required for bacterial proliferation. In species for which a 
genome sequence is not known, the technique of inverse PCR provides a method for obtaining the gene 
in order to determine the sequence or to place the probe sequences in full context to the target sequence 
to which the identified exogenous nucleic acid sequence binds. 

To practice this technique, the genome of the target organism is digested with an appropriate 
1 0 restriction enzyme so as to create fragments of nucleic acid that contain the identified sequence as well 
as unknown sequences that flank the identified sequence. These fragments are then circularized and 
become the template for the PCR reaction. PCR primers are designed in accordance with the teachings 
of Example 15 and directed to the ends of the identified sequence. The primers direct nucleic acid 
synthesis away from the known sequence and toward the unknown sequence contained within the 
15 circularized template. After the PCR reaction is complete, the resulting PCR products can be 
sequenced so as to extend the sequence of the identified gene past the core sequence of the identified 
exogenous nucleic acid sequence identified. This process can be repeated iterarivery if necessary. In 
this manner, the fiiU sequence of each novel gene can be identified. Additionally the sequences of. 
adjacent coding and noncoding regions can be identified. 
20 EXAMPLE 16 

Identification of Genes Required for Staphylococcus aureus Proliferation 
Genes required for proliferation in Staphylococcus aureus are identified according to the 
methods described above. 

EXAMPLE 17 

25 Identification of Genes Required for Neisseria gonorrhoeae Proliferation 

Genes required for proliferation in Neisseria gonorrhoeae are identified according to the 
methods described above. 

EXAMPLE 18 

Identification of Genes Required for Pseudomonas aeruginosa Proliferation 
30 Genes required for proliferation in Pseudomonas aeruginosa are identified according to the 

methods described above. 

EXAMPLE 19 

Identification of Genes Required for Enterococcus faecalis Proliferation 
Genes required for proliferation in Enterococcus faecalis are identified according to the 
35 methods described above. 
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EXAMPLE 20 

Identification of Genes Required for Haemophilus influenzae Proliferation 
Genes required for proliferation in Haemophilus influenzae are identified according to the 
methods described above. 
5 EXAMPLE 21 

Identification of Genes Required for Salmonella tvphimwium Proliferation 
Genes required for proliferation in Salmonella typhimurivm are identified according to the 
methods described above. 

EXAMPLE 22 

10 Identification of Genes Required for Helicobacter pylori Proliferation 

Genes required for proliferation in Helicobacter pylori are identified according to the 
methods described above. 

EXAMPLE 23 

Identification of Genes Required for Mycoplasma pneumoniae Proliferation 
15 Genes required for proliferation in Mycoplasma pneumoniae are identified according to the 

methods described above. 

EXAMPLE 24 

Identification of Genes Required for Plasmodium ovale Proliferation 
Genes required for proliferation in Plasmodium ovale are identified according to the methods 
20 described above. 

EXAMPLE 25 

Identification of Genes Required for Saccharomyces cerevisiae Proliferation 
Genes required for proliferation in Saccharomyces cerevisiae are identified according to the 
methods described above. 
25 EXAMPLE 26 

Identification of Genes Required for Entamoeba histolytica Proliferation 
Genes required for proliferation in Entamoeba histolytica are identified according to the 
methods described above. 

EXAMPLE 27 

30 Identification of Genes Required for Candida albicans Proliferation 

Genes required for proliferation in Candida albicans are identified according to the methods 
described above. 

EXAMPLE 28 

Identification of Genes Required for Klebsiella pneumoniae Proliferation 
35 Genes required for proliferation in Klebsiella pneumoniae are identified according to the 

methods described above. 
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EXAMPLE 29 

Identification of Genes Required for Salmonella typhi Proliferation 
Genes required for proliferation in Salmonella typhi are identified according to the methods 
described above. 
5 EXAMPLE 30 

Identification of Genes Required for Salmonella paratyphi Proliferation 
Genes required for proliferation in Salmonella paratyphi are identified according to the 
methods described above. 

EXAMPLE 31 

10 Identification of Genes Required for Salmonella cholerasuis Proliferation 

Genes required for proliferation in Salmonella cholerasuis are identified according to the 
methods described above. 

EXAMPLE 32 

Identification of Genes Required for Staphylococcus epidermis Proliferation 
15 Genes required for proliferation in Staphylococcus epidermis are identified according to the 

methods described above. 

EXAMPLE 33 

Identification of Genes Required for Mycobacterium tuberculosis Proliferation 
Genes required for proliferation in Mycobacterium tuberculosis are identified according to the 
20 methods described above. 

EXAMPLE 34 

Identification of Genes Required for Mycobacterium leprae Proliferation 
Genes required for proliferation in Mycobacterium leprae are identified according to the 
methods described above. 
25 EXAMPLE 35 

Identification of Genes Required for Treponema pallidum Proliferation 
Genes required for proliferation in Treponema pallidum are identified according to the 
methods described above. 

EXAMPLE 36 

30 Identification of Genes Required for Bacillus anthracis Proliferation 

Genes required for proliferation in Bacillus anthracis are identified, according to the methods 
described above. 

EXAMPLE 37 

Identification of Genes Required for Yersinia pestis Proliferation 
35 Genes required for proliferation in Yersinia pestis are identified according to the methods 

described above. 
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EXAMPLE 38 

Identification of Genes Required for Clostridium botulmum Proliferation 
Genes required for proliferation in Clostridium botulmum are identified according to the 
methods described above. 
5 EXAMPLE 39 

Identification of Genes Required for Campylobacter jejuni Proliferation 
Genes required for proliferation in Campylobacter jejuni are identified according to the 
methods described above. 

EXAMPLE 40 

10 Identification of Genes Required for Chlamydia trachomatis Proliferation 

Genes required for proliferation in Chlamydia trachomatis are identified according to the 
methods described above. 



It will be appreciated that genes required for proliferation of any microorganism of interest, 
15 including those specifically mentioned herein, may be identified according to the methods described 
above. 

Use of Isolated Exogenous Nucleic Acid Fragments as Antisense Antibiotics 

In addition to using the identified sequences to enable screening of molecule libraries to 
identify compounds useful to identify antibiotics, the sequences themselves can be used as therapeutic 

20 agents. Specifically, the identified exogenous sequences in an antisense orientation can be provided to 
an individual to inhibit the translation of a bacterial target gene. 

Generation of Antisense Therapeutics from Identified Exogenous Sequences 
The sequences of the present invention can be used as antisense therapeutics for the 
treatment of bacterial infections or simply for inhibition of bacterial growth in vitro or in vivo. The 

25 therapy exploits the biological process in cells where genes are transcribed into messenger RNA 
(mRNA) that is then translated into proteins. Antisense RNA technology contemplates the use of 
antisense oligonucleotides complementary to a target gene that will bind to its target nucleic acid 
and decrease or inhibit the expression of tie target gene. For example, the antisense nucleic acid 
may inhibit the translation or transcription of the target nucleic acid. In one embodiment, antisense 

30 oligonucleotides can be used to treat and control a bacterial infection of a cell culture containing a 
population of desired cells contaminated with bacteria. In another embodiment, the antisense 
oligonucleotides can be used to treat an organism with a bacterial infection. 

Antisense oligonucleotides can be synthesized from any of the sequences of the present 
invention using methods well known in the art In a preferred embodiment, antisense 

35 oligonucleotides are synthesized using artificial means. Uhlmann & Peymann, Chemical Rev. 
90:543-584 (1990) review antisense oligonucleotide technology in detail. Modified or unmodified 
antisense oligonucleotides can be used as therapeutic agents. Modified antisense oligonucleotides 



-84- 



WO 01/48209 PCT/US00/34419 

are preferred since it is well known that antisense oligonucleotides are extremely unstable. 
Modification of the phosphate backbones of the antisense oligonucleotides can be achieved by 
substituting the internucleotide phosphate residues with methylphosphonates, phosphorothioates, 
phosphoramidates, and phosphate esters. Nonphosphate internucleotide analogs such as siloxane 
5 bridges, carbonate brides, thioester bridges, as well as many others known in the art may also be 
used. The preparation of certain antisense oligonucleotides with modified internucleotide linkages 
is described in U.S. Patent No. 5,142,047. 

Modifications to the nucleoside units of the antisense oligonucleotides are also 
contemplated. These modifications can increase the half-life and increase cellular rates of uptake 

10 for the oligonucleotides in vivo. For example, a-anomeric nucleotide units and modified 
nucleotides such as 1,2-dideoxy-d-ribofuranose, 1,2-dideoxy-l-phenylribofuranose, and iV 4 , JV 4 - 
ethano-5-methyl-cytosine are contemplated for use in the present invention. 

An additional form of modified antisense molecules is found in peptide nucleic acids. 
Peptide nucleic acids (FN A) have been developed to hybridize to single and double stranded nucleic 

15 acids. PNA are nucleic acid analogs in which the entire deoxyribose-phosphate backbone has been 
exchanged with a chemically completely different, but structurally homologous, polyamide (peptide) 
backbone containing 2-aminoethyl glycine units. Unlike DNA, which is highly negatively charged, the 
PNA backbone is neutral. Therefore, there is much less repulsive energy between complementary 
strands in a PNA-DNA hybrid than in the comparable DNA-DNA hybrid, and consequently they are 

20 much more stable. PNA can hybridize to DNA in either a Watson/Crick or Hoogsteen fashion 
(Demidov et al., Proc. Natl. Acad Sci. U.SjL 92:2637-2641, 1995; Egholm, Nature 365:566-568, 
1993;NieIsenetal.,&/ence 254:1497-1500, 1991;Dueholm etal. 9 NewJ. Chem. 21:19-31, 1997). 

Molecules called PNA "clamps" have been synthesized which have two identical PNA 
sequences joined by a flexible hairpin linker containing three 8-amino-3,6-dioxaoctanoic acid units. 

25 When a PNA clamp is mixed with a complementary homopurine or homopyrimidine DNA target 
sequence, a PNA-DNA-PNA triplex hybrid can form which has been shown to be extremely stable 
(Bentin et al., Biochemistry 35:8863-8869, 1996; Egholm et aL, Nucleic Acids Res. 23:217-222, 1995; 
Griffith etal., J. Am. Chem. Soc. 117:831-832, 1995). 

The sequence-specific and high affinity duplex and triplex binding of PNA have been 

30 extensively described (Nielsen et al., Science 254:1497-1 500, 1991; Egholm et al., J. Ant Chem. Soc 
114:9677-9678, 1992; Egholm et al., Nature 365:566-568, 1993; Almarsson et al., Proc. Nad. Acad 
Sci. U.&A 90:9542-9546, 1993; Demidov et al., Proc. Natl. Acad Sci. USA. 92:2637-2641, 1995). 
They have also been shown to be resistant to nuclease and protease digestion (Demidov et al., 
Biochem. Pharm. 48:1010-13 13, 1994). PNA has been used to inhibit gene expression (Hanvey et al., 

35 Science 258:1481-1485,1992; Nielsen et al., Nucl. Acids. Res., 21:197-200, 1993; Nielsen et al., Gene 
149:139-145, 1994; Good & Nielsen, Science, 95: 2073-2076, 1998), to block restriction enzyme 
activity (Nielsen et al., supra., 1993), to act as an artificial transcription promoter (Mollegaaid, Proc. 
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Natl Acad Set U.S.A 91:3892-3895, 1994) and as a pseudo restriction endonuclease (Demidov et al., 
Nucl Acids. Res 212103-2107, 1993). Recently, PNA has also been shown to have antiviral and 
antitumoral activity mediated through an antisense mechanism (Norton, Nature Biotechnol, 14:615- 
619, 1996; Hirschman et al., J. Jmestig. Med. 44:347-351, 1996). PNAs have been linked to various 
5 peptides in order to promote PNA entry into cells (Basu et al., Bioconj. Chem. 8:481-488, 1997; 
Pardridge et al., Proc. Natl. Acad. Sci USA. 92:5592-5596, 1995). 

The antisense oligonucleotides contemplated by the present invention can be administered 
by direct application of oligonucleotides to a target using standard techniques well known in the art 
The antisense oligonucleotides can be generated within the target using a plasmid, or a phage. 

10 Alternatively, the antisense nucleic acid may be expressed from a sequence in the chromosome of 
the target cell. For example, a promoter may be introduced into the chromosme of the target cell 
near the target gene such that the promoter directs teh transcription of the antisense nucleic acid. 
Alternatively, a nucleic acid containing the antisense sequence operably linked to a promoter may 
be introduced into the chromosome of the target cell. It is further contemplated that contemplated 

1 5 that the antisense oligonucleotide contemplated are incorporated in a ribozyme sequence to enable the 
antisense to specifically bind and cleave its target mKNA. For technical applications of ribozyme and 
antisense oligonucleotides see Rossi et al., Pharmacol. Ther. 50(2):245-254, (1991). The present 
invention also contemplates using a retron to introduce an antisense oligonucleotide to a cell. 
Retron technology is exemplified by U.S. Patent No. 5,405,775. Antisense oligonucleotides can 

20 also be delivered using liposomes or by electroporation techniques which are well known in the art 
The antisense nucleic acids of the present invention can also be used to design antibiotic 
compounds comprising nucleic acids which function by intracellular triple helix formation. Triple 
helix oligonucleotides are used to inhibit transcription from a genome. The sequences identified as 
required for proliferation in the present invention, or portions thereof, can be used as templates to 

25 inhibit microorganism gene expression in individuals infected with such organisms. Traditionally, 
homopurine sequences were considered the most useful for triple helix strategies. However, 
homopyrimidine sequences can also inhibit gene expression. Such homopyrimidine oligonucleotides 
bind to the major groove at homopurine:homopyrimidine sequences. Thus, both types of sequences 
based on the sequences of the present invention that are required for proliferation are contemplated for 

30 use as antibiotic compound templates. 

The antisense oligonucleotides of this example employ the identified sequences of the 
present invention to induce bacterial cell death or at least bacterial stasis by inhibiting target 
nucleic acid transcription or translation. Antisense oligonucleotides containing from about 8 to 40 
nucleotides of the sequences of the present invention have sufficient complementary to form a 

35 duplex with the target sequence under physiological conditions. 

To kill bacterial cells or inhibit their growth, the antisense oligonucleotides are applied to 
the bacteria or to the target cells under conditions that facilitate their uptake. These conditions 
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include sufficient incubation times of cells and oligonucleotides so that the antisense 
oligonucleotides are taken up by die cells. In one embodiment, an incubation period of 7-10 days is 
sufficient to kill bacteria in a sample. An optimum concentration of antisense oligonucleotides is 
selected for use. 

5 The concentration of antisense oligonucleotides to be used can vary depending on the type 

of bacteria sought to be controlled, the nature of the antisense oligonucleotide to be used, and the 
relative toxicity of the antisense oligonucleotide to the desired cells in the treated culture. 
Antisense oligonucleotides can be introduced to cell samples at a number of different concentrations 
preferably between lxlO' 10 M to lxlO^M. Once the minimum concentration that can adequately 

10 control gene expression is identified, the optimized dose is translated into a dosage suitable for use in 
vivo. For example, an inhibiting concentration in culture of lxlO" 7 translates into a dose of 
approximately 0.6 mg/kg body weight Levels of oligonucleotide approaching 1 00 mg/kg body weight 
or higher may be possible after testing the toxicity of the oligonucleotide in laboratory animals. It is 
additionally contemplated that cells from the subject are removed, treated with the antisense 

1 5 oligonucleotide, and reintroduced into the subject. This range is merely illustrative and one of skill in 
the art are able to determine the optimal concentration to be used in a given case. 

After the bacterial cells have been killed or controlled in a desired culture, the desired cell 
population may be used for other purposes. 

EXAMPLE 41 

20 The following example demonstrates the ability of an £ coli antisense oligonucleotide to 

act as a bactericidal or bacteriostatic agent to treat a contaminated cell culture system. The 
application of the antisense oligonucleotides of the present invention are thought to inhibit the 
translation of bacterial gene products required for proliferation. 

The antisense oligonucleotide of this example corresponds to a 30 base phophorothioate 

25 modified oligodeoxynucelotide complementary to a nucleic acid involved in proliferation, such as 
Molecule Number EcXAl 18 (SEQ ID NO: 1). A sense oligodeoxynucelotide complementary to 
the antisense sequence is synthesized and used as a control. The oligonucleotides are synthesized 
and purified according to the procedures of Matsukura, et al., Gene 72:343 (1988). The test 
oligonucleotides are dissolved in a small volume of autoclaved water and added to culture medium 

30 to make a 100 micromolar stock solution. 

Human bone marrow cells are obtained from the peripheral blood of two patients and 
cultured according standard procedures well known in the art The culture is contaminated with the 
K-12 strain of £ coli and incubated at 37°C overnight to establish bacterial infection. 

The control and antisense oligonucleotide containing solutions are added to the 

35 contaminated cultures and monitored for bacterial growth. After a 10 hour incubation of culture 
and oligonucleotides, samples from the control and experimental cultures are drawn and analyzed 
for the translation of the target bacterial gene using standard microbiological techniques well 
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known in the art The target E. coli gene is found to be translated in the control culture treated with 
the control oligonucleotide, however, translation of the target gene in the experimental culture 
treated with the antisense oligonucleotide of the present invention is not detected or reduced. 

One way to determine if a gene is essential for proliferation in a host or virulence in a host is 
5 to construct a conditional allele of the gene an infectious organism. The host is then challenged with 
the organism under conditions in which the product of the gene is functional or non-functional or has 
reduced activity or where the gene product is absent or else present but at a reduced level. If the gene is 
essential for proliferation or virulence the infection of the host will be diminished or abolished under 
conditions in which the product of die gene is not functional or has reduced activity or where the gene 

1 0 product is absent or else present but at a reduced level. 

Since expression of an antisense nucleic acid complementary to a gene required for 
proliferation also decreases the synthesis of the gene product, antisense nucleic acids may also be used 
to evaluate whether a gene is essential for the proliferation or virulence of an infectious organism in the 
host In such methods, nucleic acids encoding an antisense molecule complementary to the desired 

1 5 target gene are introduced into the infectious organism. For exampe, plasmids comprising one of SEQ 
ID NOs: 1-93 or fragments thereof which inhibit proliferation, may be introduced into the infectious 
organism. In some embodiments, the antisense nucleic acid may be transcribed from the BPTG- 
inducible promoter in pLEXSBA or from other regulated promoters or vector systems. 

E. coli is transformed with the nucleic acid encoding the antisense molecule by electroporation 

20 and grown in medium which selects for the presence of the vector from which the antisense nucleic 
acid is expressed. The essentiality of the target for each antisense nucleic acid is verified in 
microorganisms grown in culture using the techniques described herein. 

The ability of antisense expression to block K coli infection in an animal is tested using the 
rabbit model of bacterial meningitis. A spinal needle is surgically placed into the cisterna magna of 

25 New Zealand White rabbits. The rabbits are inoculated with 10 5 to 10 6 cells of a normally virulent E. 
coli strain expressing an antisense nucleic acid complementary to a gene required for proliferation. 
Repeated CSF sampling is undertaken to determine multiple parameters of injury and infection such as 
cytochemical abnormalities, intracranial pressure, cerebral edema, BBB permeability, cerebral 
perfusion pressure and recovery of viable £ coli cells. Control animals are given intravenous injections 

30 of saline, which will not induce expression of the antisense nucleic acid, while experimental animals 
are given IPTG in intravenous injections to induce expression of die antisense nucleic acid. 
Alternatively, expression of the antisense nucleic acid may be induced by intravenous infusion of BPTG 
at sub-toxic levels. If other promoters other than IPTG inducible promoters are used, the rabbits may 
be fed the inducer in their water. 

35 Hie use of rabbits allows multiple CSF samples per animal (one rabbit can give up to 8 

sequential samples without change in CSF pressure). Treated animals receive therapy from 2 hours 
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post receiving inoculation up to several days. A typical efficacy study consists of 3 control animals and 
3 treated animals. 

The control animals in which expression of the antisense nucliec acid is not induced are not 
protected against infection with R coli and there is a logarithmic increase in viable bacteria. In 
5 experimental animals, K coli cells recovered from the site of infection are viable until antisense 
expression is subsequently induced. However, if the antisense nucleic acid is directed against a gene 
required for proliferation, after treatment with the inducer for antisense expression the E. coli cells 
infecting these rabbits will not multiply and fewer viable cells will be recovered from the site of 
infection. The K coli cells recovered from die rabbits treated with the inducer are recovered, if still 
10 present, and assayed as above to determine if the promoter and gene are still present and functional 
Conversely, if the antisense nucleic acid is not complementary to a gene required for proliferation, 
treatment of the rabbits with inducer will have no effect on R coli viability. 

EXAMPLE 42 

A subject suffering from an R coli infection is treated with the antisense oligonucleotide 
15 preparation of Example 39. The antisense oligonucleotide is provided in a pharmaceutically 
acceptable carrier at a concentration effective to inhibit the transcription or translation of the target 
nucleic acid. The present subject is treated with a concentration of antisense oligonucleotide 
sufficient to achieve a blood concentration of about 100 micromolar. The patient receives daily , 
injections of antisense oligonucleotide to maintain this concentration for a period of 1 week. At the 
20 end of the week a blood sample is drawn and analyzed for the presence or absence of the orgranism 
using standard techniques well known in the art. There is no detectable evidence of E. coli and the 
treatment is terminated. 

EXAMPLE 43 
Preparation and use of Triple Helix Probes 
25 The sequences of microorganism genes required for proliferation of the present invention are 

scanned to identify 10-mer to 20-mer homopyrimidine or homopurine stretches that could be used in 
triple-helix based strategies for inhibiting gene expression. Following identification of candidate 
homopyrimidine or homopurine stretches, their efficiency in inhibiting gene expression is assessed by 
introducing varying amounts of oligonucleotides containing the candidate sequences into a population 
30 of bacterial cells that normally express the target gene. The oligonucleotides may be prepared on an 
oligonucleotide synthesizer or they may be purchased commercially from a company specializing in 
custom oligonucleotide synthesis, such as GENSET, Paris, France. 

The oligonucleotides can be introduced into the cells using a variety of methods known to 
those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE-Dextran, 
35 electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for a reduction in proliferation using techniques such as 
monitoring growth levels as compared to untreated cells using optical density measurements. Hie 
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oligonucleotides that are effective in inhibiting gene expression in cultured cells can then be introduced 
m vivo using the techniques well known in that art at a dosage level shown to be effective. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be replaced 
with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an intercalating 
5 agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha oligonucleotide 
to stabilize the triple helix. For information on the generation of oligonucleotides suitable for triple 
helix formation see Griffin et al. (Science 245:967-971 (1989)). 

EXAMPLE 44 

Identification of Bacterial Strains from Isolated Specimens by PCR 
10 Classical bacteriological methods for the detection of various bacterial species are time 

consuming and costly. These methods include growing the bacteria isolated from a subject in 
specialized media, cultivation on selective agar media, followed by a set of confirmation assays that 
can take from 8 to 10 days or longer to complete. Use of the identified sequences of the present 
invention provides a method to dramatically reduce the time necessary to detect and identify 
1 5 specific bacterial species present in a sample. 

In one exemplary method, bacteria are grown in enriched media and DNA samples are isolated 
from specimens of, for example, blood, urine, stool, saliva or central nervous system fluid by 
conventional methods. A panel of PCR primers based on identified sequences unique to various 
species of microorganisms are then utilized in accordance with Example 12 to amplify DNA of 
20 approximately 100-200 nucleotides in length from the specimen. A separate PCR reaction is set up for . 
each pair of PCR primers and after the PCR reaction is complete, the reaction mixtures are assayed for 
the presence of PCR product The presence or absence of bacteria from the species to which the PCR 
primer pairs belong is determined by the presence or absence of a PCR product in die various test PCR 
reaction tubes. 

25 Although the PCR reaction is used to assay the isolated sample for the presence of various 

bacterial species, other assays such as the Southern blot hybridization are also contemplated. 
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WHAT IS CLAIMED IS : 

1. A purified or isolated nucleic acid sequence consisting essentially of one the 
sequence of nucleotides of SEQ ID NOs: 1-93, wherein expression of said nucleic acid in a 
microorganism is capable of inhibiting proliferation of a microorganism. 
5 2. The nucleic acid sequence of Claim 1, wherein said nucleic acid sequence is 

complementary to at least a portion of the nucleotide sequence of the coding strand of a gene whose 
expression is required for proliferation of a microorganism. 

3. Hie nucleic acid of Claim 1, wherein said nucleic acid sequence has a nucleotide 
seuqence complementary to at least a portion of the nucleotide sequence of an RNA required for 

10 proliferation of a microorganism. 

4. The nucleic acid of Claim 3, wherein the nucleotide sequence of said RNA encodes 
more than one gene product 

5. A purified or isolated nucleic acid comprising a fragment of one of the nucleotide 
seuqence of SEQ ID NOs.: 1-93, said fragment selected from the group consisting of fragments 

15 comprising at least 10, at least 20, at least 25, at least 30, at least 50 and more than 50 consecutive 
nucleotides of one of the nucleotide sequences of SEQ ID NOs: 1-93. 

6. A vector comprising a promoter operably linked to the nucleic acid sequence of 
Claims 1,2,3,4, or 5. 

7. The vector of Claim 6, wherein said promoter is active in a microorganism selected 
20 from the group consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, 

Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata),, 
Candida tropicalis, Candida parapsilosis, Candida guilliemwndii, Candida krusei, Candida kefyr 
(also called Candida pseudotropiccdis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia * 
trachomatus, Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, 

25 Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, Mycobacterium leprae, 
Mycobacteriwn tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella 
cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella 

30 catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas 
aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, 
Yersinia pestis and any species falling within the genera of any of the above species. 

8. A host cell containing the vector of Claim 6. 

9. A purified or isolated nucleic acid consisting essentially of the coding sequence of 
35 one of SEQ ID NOs: 106-1 12, 1 19-122, 134-160, 164-171, 179-265, 271-273, 275, and 279-286. 
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10. A fragment of the nucleic acid of Claim 8, said fragment comprising at least 1 0, at 
least 20, at least 25, at least 30, at least 50 or more than 50 consecutive nucleotides of one of SEQ 
IDNOs: 106-112, 119-122, 134-160, 164-171, 179-265, 271-273, 275, and 279-286. 

11. A vector comprising a promoter operably linked to the nucleic acid of Claim 9 or 
5 Claim 10. 

12. A purified or isolated antisense nucleic acid comprising a nucleic acid sequence 
complementary to at least a portion of an intragenic sequence, intergenic sequence, sequences 
spanning at least a portion of two or more genes, 5* noncoding region, or 3' noncoding region 
within an operon comprising a proliferation-required gene whose activity or expression is inhibited 

10 by an antisense nucleic acid comprising one of SEQ IDNOs.: 1-93. 

13. A purified or isolated nucleic acid comprising a nucleic acid having at least 70% 
identity to a sequence selected from the group consisting of SEQ ED NOs.: 1-93, fragments 
comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-93, the sequences 
complementary to SEQ ID NOs.: 1-93 and the sequences complementary to fragments comprising 

15 at least 25 consecutive nucleotides of SEQ ID NOs.: 1-93 as determined using BLASTN version 
2.0 with the default parameters. 

14. The nucleic acid of Claim 13, wherein said nucleic acid is from an organism selected 
from the group consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 

20 Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kejyr 
(also ccdted Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatus, Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Klebsiella pneumoniae, Listeria monocytogenes, Mycobacterium leprae, 

25 Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella 
cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella 
catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas 
aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, 

30 Yersinia pestis and any species falling within the genera of any of the above species. 

15. A vector comprising a promoter operably linked to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 
NOs.: 1-93. 

16. A host cell containing the vector of Claim 15. 

35 17. The vector of Claim 15, wherein said polypeptide comprises a polypeptide 

comprising a sequence selected from the group consisting of SEQ ID NOs: 299-305, 312-315, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479. 
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18. A purified or isolated polypeptide comprising a polypeptide whose expression is 
inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93, or a fragment selected 
from the group consisting of fragments comprising at least 5, at least 10, at least 20, at least 30, at 
least 40, at least 50, at least 60 or more than 60 consecutive amino acids of one of the said 

5 polypeptides. 

19. The polypeptide of Claim 18, wherein said polypeptide comprises a polypeptide 
comprising one of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 and 
472-479 or a fragment comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 
50, at least 60 or more than 60 consecutive amino acids of a polypeptide comprising a sequence 

10 selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 
464-466, 468 and 472-479. 

20. A purified or isolated polypeptide comprising a polypeptide having at least 25% 
identity to a polypeptide whose expression is inhibited by a sequence selected from the group 
consisting of SEQ ID NOs.: 1-93, or at least 25% identity to a fragment comprising at least 5, at 

15 least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 
amino acids of a polypeptide whose expression is inhibited by a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-93 as determined using FASTA version 3.0t78 with the 
default parameters. 

21 . The polypeptide of Claim 20, wherein said polypeptide has at least 25% identity to 
20 a polypeptide comprising one of SEQ ID NOs: 299-305, 312-315, 327-353, 357-364, 372-458, 464- 

466, 468 and 472-479 or at least 25% identity to a fragment comprising at least 5, at least 10, at 
least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive amino acids of a 
polypeptide comprising one of SEQ ID NOs.: 299-305, 3 12-315, 327-353, 357-364, 372-458, 464- 
466, 468 and 472-479 as determined using FASTA version 3.0t78 with the default parameters. 
25 22. An antibody capable of specifically binding the polypeptide of one of Claims 18- 

21. 

23. A method of producing a polypeptide, comprising introducing a vector comprising 
a promoter operably linked to a nucleic acid encoding a polypeptide whose expression is inhibited 
by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93 into a cell and expressing said 

30 polypeptide. 

24. The method of Claim 23, further comprising the step of isolating said polypeptide. 

25. The method of Claim 23, wherein said polypeptide comprises a sequence selected 
from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464- 
466, 468 and 472-479. 
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26. A method of inhibiting proliferation of a microorganism comprising inhibiting the 
activity or reducing the amount of a gene product whose expression is inhibited by an anti sense 
nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93 or 
inhibiting the activity or reducing the amount of a nucleic acid encoding said gene product 
5 27. The method of Claim 26, wherein said gene product comprises a polypeptide 

comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-31 5, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479. 

28. A method for identifying a compound which influences the activity of a gene 
product required for proliferation, said gene product comprising a gene product whose expression is 

1 0 inhibited by an antisense nucleic acid comprising a sequence selected from the group consisting of 
SEQ ID NOs.: 1-93, said method comprising: 

contacting said gene product with a candidate compound; and 

determining whether said compound influences the activity of said gene product. 

29. The method of Claim 28, wherein said gene product is a polypeptide and said 
1 5 activity is an enzymatic activity. 

30. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a carbon compound catabolism activity. 

31. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a biosynthetic activity. 

20 32. The method of Claim 28, wherein said gene product is a polypeptide and said 

activity is a transporter activity. 

33. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a transcriptional activity. 

34. The method of Claim 28, wherein said gene product is a polypeptide and said 
25 activity is a DNA replication activity. 

35. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a cell division activity. 

36. A compound identified using the method of Claim 28. 

37. The method of Claim 28, wherein said gene product is a polypeptide comprising a 
30 sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357- 

364, 372-458, 464-466, 468 and 472-479. 



35 
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38. A method for identifying a compound or nucleic acid having the ability to reduce 
the activity or level of a gene product required for proliferation, said gene product comprising a 
gene product whose activity or expression is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-93, said method comprising: 

5 (a) providing a target that is a gene or RNA, wherein said target comprises a 

nucleic acid encoding said gene product; 

(b) contacting said target with a candidate compound or nucleic acid; and 

(c) measuring an activity of said target 

39. The method of Claim 38, wherein said target is a messenger RNA molecule and 
1 0 said activity is translation of said messenger RNA. 

40. The method of Claim 38, wherein said target is a messenger RNA molecule and 
said activity is transcription of a gene encoding said messenger RNA. 

41. The method of Claim 38, wherein said target is a gene and said activity is 
transcription of said gene. 

15 42. The method of Claim 38, wherein said target is a nontranslated RNA and said 

activity is processing or folding of said nontranslated RNA or assembly of said nontranslated RNA 
into a protein/RNA complex. 

43. The method of Claim 38, wherein said target gene or RNA encodes a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327- 

20 353, 357-364, 372-458, 464-466, 468 and 472-479. 

44. A compound or nucleic acid identified using the method of Claim 38. 

45. A method for identifying a compound which reduces the activity or level of a gene 
product required for proliferation of a microorganism, wherein the activity or expression of said - 
gene product is inhibited by an antisense nucleic acid comprising a sequence selected from the 

25 group consisting of SEQ ID NOs.: 1-93, said method comprising the steps of: 

(a) expressing a sub-lethal level of an antisense nucleic acid complementary to 
a nucleic acid encoding said gene product in a cell to reduce the activity or amount of said 
gene product in said cell, thereby producing a sensitized cell; 

(b) contacting said sensitized cell with a compound; and 

30 (c) determining whether said compound inhibits the growth of said sensitized 

cell. 

46. The method of Claim 45, wherein said determining step comprises determining 
whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized cell. 

35 47. The method of Claim 45, wherein said cell is selected from the group consisting of 

bacterial cells, fungal cells, plant cells, and animal cells. 

48. The method of Claim 45, wherein said cell is a Gram negative bacterium. 
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49. The method of Claim 45, wherein said cell is an K coli cell. 

50. The method of Claim 45, wherein said cell is from an organism selected from the group 
consisting of Aspergillus Jumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabratq), Candida tropicalis, 

5 Candida parapsilosis, Candida guilliermondii, Candida krusei t Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enierobacter cloacae, 
Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, 

10 Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, 
Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
Klebsiella pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella 
dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus 
epidermidis, Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species 

1 5 falling within the genera of any of the above species. , 

5 1 . The method of Claim 45, wherein said antisense nucleic acid is transcribed from an 
inducible promoter. 

52. The method of Claim 51, further comprising the step of contacting said cell with a 
concentration of inducer which induces said antisense nucleic acid to a sub-lethal level. 

20 53. The method of Claim 45, wherein growth inhibition is measured by monitoring 

optical density of a culture growth solution. 

54. The method of Claim 45, wherein said gene product is a polypeptide. 

55. The method of Claim 54, wherein said polypeptide comprises a sequence selected 
from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357-364, 372-458, 464- 

25 466, 468 and 472-479. 

56. The method of Claim 45, wherein said gene product is an RNA. 

57. A compound identified using the method of Claim 45. 

58. A method for inhibiting cellular proliferation comprising introducing a compound 
with activity against a gene whose activity or expression is inhibited by an antisense nucleic acid 

30 comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93 or a compound 
with activity against the product of said gene into a population of cells expressing said gene. 

59. The method of Claim 58, wherein said compound is an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, or a proliferation- 
inhibiting portion thereof. 

35 60. The method of Claim 59, wherein said proliferation inhibiting portion of one of 

SEQ ID NOs.: 1-93 is a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 
50 or more than 5 1 consecutive nucleotides of one of SEQ ID NOs.: 1-93. 
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61 . The method of Claim 58, wherein said population is a population selected from the 
group consisting of bacterial cells, fungal cells, plant cells, and animal cells. 

62. The method of Claim 58, wherein said population is a population of Gram negative 
bacteria. 

5 63. The method of Claim 58, wherein said population is a population of R coli cells. 

64. The method of Claim 58, wherein said population is a population selected from the 
group consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), 
Candida tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kejyr 

10 (also called Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia 
trachomatis, Clostridium botulinwn, Clostridium difficile, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, 
Helicobacter pylori, Klebsiella pneumoniae,Listeria monocytogenes, Mycobacterium leprae, 
Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella 

15 cholerasuis, Salmonella enterica, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurhan, Staphylococcus aureus, Klebsiella pneumoniae, Listeria monocytogenes, Moxarella 
catarrhalis, Shigella boydii, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas 
aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, 
Yersinia pestis and any species falling within the genera of any of the above species. 

20 65. . The method of Claim 58, wherein said gene encodes a polypeptide comprising a 

sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327-353, 357- 
364, 372-458, 464-466, 468 and 472-479. 

66. A preparation comprising an effective concentration of an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ED NOs.: 1-93, or a proliferation- 

25 . inhibiting portion thereof in a pharmaceutically acceptable carrier. 

67. The preparation of Claim 66, wherein said proliferation-inhibiting portion of one of 
SEQ ID NOs.: 1-93 comprises at least 10, at least 20, at least 25, at least 30, at least 50 or more than 
50 consecutive nucleotides ofoneofSEQ ID NOs.: 1-93. 

68. A method for inhibiting the activity or expression of a gene in an operon required 
30 for proliferation wherein the activity or expression of at least one gene in said operon is inhibited by 

an antisense nucleic acid comprising a sequence selected from the group consisting of SEQ ID 
NOs.: 1-93, said method comprising contacting a cell in a cell population with an antisense nucleic 
acid comprising at least a proliferation-inhibiting portion of said operon. 

69. The method of Claim 68, wherein said antisense nucleic acid comprises a sequence 
35 selected from the group consisting of SEQ ID NOs.: 1-93 or a proliferation inhibiting portion 

thereof. 
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70. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a plasmid which expresses said antisense nucleic acid into said cell population. 

71. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a phage which expresses said antisense nucleic acid into said cell population. 

5 72. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by expressing said antisense nucleic acid from the chromosome of cells in said cell population. 
73. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
• acid by introducing a promoter adjacent to a chromosomal copy of said antisense nucleic acid such 
that said promoter directs the synthesis of said antisense nucleic acid. 
10 74. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by introducing a retron which expresses said antisense nucleic acid into said cell population. 

75. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a ribozyme into said cell-population, wherein a binding portion of said 
ribozyme is complementary to said antisense oligonucleotide. 
15 76. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by introducing a liposome comprising said antisense oligonucleotide into said cell. 

77. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by electroporation of said antisense nucleic acid. 

78. The method of Claim 68, wherein said antisense nucleic acid is a fragment 
20 comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive 

nucleotides of one of SEQ ID NOs.: 1-93. 

79. The method of Claim 68 wherein said antisense nucleic acid is an oligonucleotide. 

80. A method for identifying a gene which is required for proliferation of a 
microorganism comprising: 

25 (a) contacting a microorganism other than E. coli with a nucleic acid selected 

from the group consisting of SEQ ID NOs.: 1-93; 

(b) determining whether said nucleic acid inhibits proliferation of said 
microorganism; and 

(c) identifying the gene in said microorganism which is inhibited by said 
30 nucleic acid. 

81. Hie method of Claim 80, wherein said microorganism is a Gram negative 
bacterium. 

82. The method of Claim 80 wherein said microorganism is selected from the group 
consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter 

35 jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropiccdis, 
Candida parapsilosis, Candida guilliermondii, Candida kmsei, Candida kejyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatus, 
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Clostridium botulmum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coll, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, 
Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, 
5 Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium. Staphylococcus aureus, 
Klebsiella pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella 
dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus 
epidermidis, Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species 
falling within the genera of any of the above species. 
10 83. The method of Claim 80, further comprising introducing said nucleic acid into a 

vector functional in said microorganism prior to introducing said inhibitory nucleic acid into said 
microorganism. 

84. A method for identifying a compound having the ability to inhibit proliferation of a 
microorganism comprising: 

15 (a) identifying in a first microorganism a homolog of a gene or gene product 

present in a second microorganism which is different than said first microorganism, 
wherein the activity or level of said gene or gene product is inhibited by a nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs. 1-93 ; 

(b) identifying an inhibitory nucleic acid sequence which inhibits the activity 
20 of said homolog in said first microorganism; 

(c) contacting said first microorganism with a sub-lethal level of said 
inhibitory nucleic acid, thus sensitizing said first microorganism; 

(d) contacting the sensitized microorganism of step (c) with a compound; and 

(e) determining whether said compound inhibits proliferation of said sensitized 
25 microorganism. 

85. The method of Claim 84, wherein said determining step comprises determining 
whether said compound inhibits proliferation of said sensitized microorganism to a greater extent 
than said compound inhibits proliferation of a nonsensitized microorganism. 

86. The method of Claim 84 wherein step (a) comprises identifying a homologous 
30 nucleic acid to a gene or gene product whose activity or level is inhibited by a nucleic acid selected 

from the group consisting of SEQ ID NOs. 1-93 or a nucleic acid encoding a homologous 
polypeptide to a polypeptide whose activity or level is inhibited by a nucleic acid selected from the 
group consisting of SEQ ID NOs. 1-93 by using an algorithm selected from the group consisting of 
BLASTN version 2.0 with the default parameters and FASTA version 3.0t78 algorithm with the 
35 default parameters to identify said homologous nucleic acid or said nucleic acid encoding a 
homologous polypeptide in a database. 



-99- 



WO 01/48209 PCTAJSOO/34419 

87. The method of Claim 84 wherein said step (a) comprises identifying a homologous 
nucleic acid or a nucleic acid encoding a homologous polypeptide by identifying nucleic acids 
which hybridize to said first gene. 

88. The method of Claim 84 wherein the step (a) comprises expressing a nucleic acid 
5 selected from the group consisting of SEQ ID NOs. 1-93 in said microorganism. 

89. The method of Claim 84, wherein said inhibitory nucleic acid is an antisense 
nucleic acid. 

90. The method of Claim 84, wherein said inhibitoiy nucleic acid comprises an 
antisense nucleic acid to a portion of said homolog. 

10 91. The method of Claim 84, wherein said inhibitory nucleic acid comprises an 

antisense nucleic acid to a portion of the operon encoding said homolog. 

92. The method of Claim 84, wherein the step of contacting the first microorganism 

with a sub-lethal level of said inhibitory nucleic acid comprises directly contacting said 

microorganism with said inhibitory nucleic acid. 
15 93. The method of Claim 84, wherein the step of contacting the first microorganism 

with a sub-lethal level of said inhibitory nucleic acid comprises expressing an antisense nucleic 

acid to said homolog in said microorganism. 

94. The method of Claim 84, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 3 12-315, 327- 

20 353, 357-364, 372-458, 464-466, 468 and 472-479. 

95. A compound identified using the method of Claim 84. 

96. A method of identifying a compound having the ability to inhibit proliferation 
comprising: 

(a) contacting a microorganism other than K coli with a sub-lethal level of a 
25 nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1- 

93 or a portion thereof which inhibits the proliferation of E. coli, thus sensitizing said 
microorganism; 

(b) contacting the sensitized microorganism of step (a) with a compound; and 

(c) determining whether said compound inhibits proliferation of said sensitized 
30 microorganism. 

97. The method of Claim 96, wherein said determining step comprises determining 
whether said compound inhibits proliferation of said sensitized microorganism to a greater extent 
than said compound inhibits proliferation of a nonsensitized microorganism. 

98. A compound identified using the method of Claim 96. 

35 
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99. A method for identifying a compound having activity against a biological pathway 
required for proliferation comprising: 

(a) sensitizing a cell by expressing a sub-lethal level of an antisense nucleic 
acid complementary to a nucleic acid encoding a gene product required for proliferation, 

5 wherein the activity or expression of said gene product is inhibited by an antisense nucleic 

acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, in 
said cell to reduce die activity or amount of said gene product; 

(b) contacting the sensitized cell with a compound;.and 

(c) determining whether said compound inhibits die growth of said sensitized 

10 cell. 

100. The method of Claim 99, wherein said determining step comprises determining 
whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized cell. 

101. The method of Claim 99, wherein said cell is selected from the group consisting of 
1 5 bacterial cells, fungal cells, plant cells, and animal cells. 

1 02. The method of Claim 99, wherein said cell is a Gram negative bacterium. 

1 03 . the method of Claim 99, wherein said Gram negative bacterium is E. coli. 

104. The method of Claim 99, wherein said cell is selected from the group consisting of 
Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter jejuni, Candida 

20 albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella 

25 pneumoniae, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, 
Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, 
Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
Klebsiella pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella 
dysenteriae, Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus 

30 epidermidis, Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species 
felling within the genera of any of the above species. 

105. The method of Claim 99, wherein said antisense nucleic acid is transcribed from an 
inducible promoter. 

106. The method of Claim 99, further comprising contacting the cell with an agent 
35 which induces expression of said antisense nucleic acid from said inducible promoter, wherein said 

antisense nucleic acid is expressed at a sub-lethal level. 
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107. The method of Claim 99, wherein inhibition of proliferation is measured by 
monitoring the optical density of a liquid culture. 

108. The method of Claim 99, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327- 

5 353, 357-364, 372-458, 464-466, 468 and 472-479 

1 09. A compound identified using the method of Claim 99. 

110. A method for identifying a compound having the ability to inhibit cellular 
proliferation comprising: 

(a) contacting a cell with an agent which reduces the activity or level of a gene 
10 product required for proliferation of said cell, wherein said gene product is a gene product 

whose activity or expression is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-93; 

(b) contacting said cell with a compound; and 

(c) determining whether said compound reduces proliferation of said contacted 

15 cell. 

111. The method of Claim 110, wherein said determining step comprises determining 
whether said compound reduces proliferation of said contacted cell to a greater extent than said 
compound reduces proliferation of cells which have not been contacted with said agent. 

1 12. The method of Claim 110, wherein said agent which reduces the activity or level of 
20 a gene product required for proliferation of said cell comprises an antisense nucleic acid, to a gene 

or operon required for proliferation. 

113. The method of Claim 110, wherein said agent which reduces the activity or level of 
a gene product required for proliferation of said cell comprises a compound known to inhibit 
growth or proliferation of a microorganism. 

25 1 14. The method of Claim 1 10, wherein said cell contains a mutation which reduces the 

activity or level of said gene product required for proliferation of said cell. 

115. The method of Claim 114, wherein said mutation is a temperature sensitive 
mutation. 

116. The method of Claim 110, wherein said gene product comprises a polypeptide 
30 comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327- 

353, 357-364, 372-458, 464-466, 468 and 472-479 

117. A compound identified using the method of Claim 110. 



35 
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118. A method for identifying the biological pathway in which a proliferation-required 
gene or its gene product lies, wherein said gene or gene product comprises a gene or gene product 
whose activity or expression is inhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ IDNOs.: 1-93, said method comprising: 

5 (a) expressing a sub-lethal level of an antisense nucleic acid which inhibits the 

activity of said proliferation-required gene or gene product in a cell; 

(b) contacting said cell with a compound known to inhibit growth or 
proliferation of a microorganism, wherein the biological pathway on which said compound 
acts is known; and 

1 o (c) determining whether said cell is sensitive to said compound. 

119. The method of Claim 118, wherein said determining step comprises determining 
whether said cell has a substantially greater sensitivity to said compound than a cell which does not 
express said sub-lethal level of said antisense nucleic acid and wherein said gene or gene product 
lies in the same pathway on which said compound acts if said cell expressing said sub-lethal level 

15 of said antisense nucleic acid has a substantially greater sensitivity to said compound than said cell 
which does not express said sub-lethal level of said antisense nucleic acid. 

120. The method of Claim 118, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 3 12-3 1 5, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479 

20 121. A method for determining the biological pathway on which a test compound acts 

comprising: 

(a) expressing a sub-lethal level of an antisense nucleic acid complementary to 
a proliferation-required nucleic acid in a cell, wherein the activity or expression of said 
proliferation-required nucleic acid is inhibited by an antisense nucleic acid comprising a 

25 sequence selected from the group consisting of SEQ ID NOs.: 1-93 and wherein the 

biological pathway in which said proliferation-required nucleic acid or a protein encoded 
by said proliferation-required polypeptide lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell is sensitive to said test compound. 

30 122. The method of Claim 121, wherein said determining step comprises determining 

whether said cell has a substantially greater sensitivity to said test compound than a cell which does 
not express said sub-lethal level of said antisense nucleic acid. 

35 
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123. The method of Claim 121, further comprising: 

(d) expressing a sub-lethal level of a second antisense nucleic acid 
complementary to a second proliferation-required nucleic acid in a second cell, wherein 
said second proliferation-required nucleic acid is in a different biological pathway than said 

5 proliferation-required nucleic acid in step (a); and 

(e) determining whether said second cell does not have a substantially greater 
sensitivity to said test compound than a cell which does not express said sub-lethal level of 
said second antisense nucleic acid, wherein said test compound is specific for the biological 
pathway against which the antisense nucleic acid of step (a) acts if said second cell does not 

1 0 have substantially greater sensitivity to said test compound. 

124. A purified or isolated nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-93. 

125. A compound which interacts with a gene or gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93 to inhibit 

15 proliferation. 

126. A compound which interacts with a polypeptide whose expression is inhibited by 
an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93 to inhibit proliferation. 

127. A method for manufacturing an antibiotic comprising the steps of: 

screening one or more candidate compounds to identify a compound that reduces 
20 the activity or level of a gene product required for proliferation, said gene product 

comprising a gene product whose activity or expression is inhibited by an antisense nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93; and 

manufacturing the compound so identified. 

128. The method of Claim 127, wherein said screening step comprises performing any 
25 one of the methods of Claims 28, 38, 45, 96, 99 and 1 10. 

129. A method for inhibiting proliferation of a microorganism in a subject comprising 
administering a compound that reduces the activity or level of a gene product required for 
proliferation of said microorganism, said gene product comprising a gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 

30 consisting of SEQ ID NOs.: 1-93 to said subject 

130. The method of Claim 129 wherein said subject is selected from the group 
consisting of vertebrates, mammals, avians, and human beings. 

131. The method of Claim 129, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-305, 312-315, 327- 

35 353, 357-364, 372-458, 464-466, 468 and 472-479. 
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SEQUENCE LISTING 

<110> ELITRA PHARMACEUTICALS, INC. 
FORSYTH, R. Allyn 
OHLSEN, Kari L. 
ZYSKIND, Judith W. 

<120> GENES IDENTIFIED AS REQUIRED FOR 
. PROLIFERATION OF E;.. COLI 

<130> ELITRA. 009VPC 

<150> US 60/173,005 
<151> 1999-12-23 

<160> 481 ' 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 142 
<212> DNA 

<213> Escherichia coli 
<4Q0> 1 

cgctccttag agtacttctg gtgccagaga gataattttc atgaacttct cactacgaag 60* 
ctcacgagtt accggcccaa aaatacgcgt accgataggc tgctcgctgt tgttgttcag 120 
aagaacacaa gcattaccat eg 142 

<210> 2 
<211> 154 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
•<222> (1) ... (154) 
<223> n ■« A,T,C or-G . 

<400> 2 

gctccttaga gtacttntgg tgeeggagag ataattttca tgaacttctc actacnaagc 60 
tcacgagtta ccggcccaaa aatacgcgta ccgataggct gctcgctgtt gttgttcacg 120 
aagaacacaa gcattaccat egaagegaat gaca 154 

<210> 3 
<211> 140' 
<212> DNA 

<213> Escherichia coli 
<400> 3 

tacttctggt gecagagaga taattttcat gaacttctca ctacgaagct cacgagttac 60 
cggcccaaaa atacgegtae cgataggctg ctcgctgttg ttgttcagaa gaacacaagc 120 
attaccatcg aagcgaatga 140 

<210>" 4 
<211> 177 
<212> DNA 

<213> Escherichia coli 
<400> 4 

. cgctccttag agtacttctg gtgccagaga gataattttc atgaacttct cactacgaag 60 
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ctcacgagtt accggcccaa aaatacgcgt accgataggc tgctcgctgt tgttgttcag 120 
aagaacacaa gcattaccat cgaagcgaat gacagaaccg tccgggcgac gaacacc 177 

<210> 5 
<211> 255 
<212> DNA 

<213> Escherichia coli 



<400> 5 

ctctttttct tcgagatgga cgcagtgagc 
ctgatggtaa acatccagat aaccatcatg 
ttcatctttg ttttcacaga gatgggtatg 
gcgttgcgcc atcgccatct gttcaggaga 
atatagcaga cgacc 

<210> 6 
<211> 395 
<212> DNA. 

<213> Escherichia coli 



aaagacacag tttttaccgg tcaggccgta 60 
gtcaggataa agcgatttca cccaggcaat 120 
tacccacgta tccggatatt cttctttcag 180 
tgaggtcggg gcgaagcgtg gcgtaatcgc 240 

255 



<400> 6 

tccgcgatcg cggataaaaa ggaggattta 
gcaatattaa gccgcatgcc atctcgacat 
actcccagca tgccctgagt atcgctatag 
ccccacaggc taaggttgtt atttactttc 
ttacgcgcac cttcacggcc tacggtttga 
ttgttgatcc agttcgcttc aatgtaaggc 
tggtgatggc tgttcaggta ggttttcacc 

<210> 7 
<211> 280 
<212> DNA 

<213> Escherichia coli 



aataaggcgt tgatgcgccg tttgaagtcg 60 
gcggcttata cggtttacca gctatatttc 120 
cctttatcac ctagttgcac accgacattc 180 
gcctcaaccc cggtacgtac ttcaccgaga 240 
ccattcatct tcacggcgta gactttgcta 300 
tggaactcac gctgtttacc atcgtcacgc 360 
ccaag 395 



<400> 7 

tgatcgatca ggtaatttgc cagcggtaga 
cacatttccc gcccaagcag aaacgcggtg 
ggattaacca gctgcgaggc cgctttagcg 
tgatgcagat tcatcggcgt ccagcgcggg 
cgcacttttt tcagtttgcc gctttcgtca 



taattagtga tgatttgcac tggcttgcca 60 
gagccgcagt tgatgactac gctttcgccc 120 
atacgtactt tttcatcgtg attctgcgcc 180 
cgctgttggg taatagcttc tgcgccattg 240 
agtttattga 280 



<210> 8 
<211> 363 
<212> DNA 

<213> Escherichia coli 



<400> 8 

agtaaaggat atacaaattt aaaggaagtt 
gtttcaattc cgataacagc cagttactac 
gcactggagg caaccgcatt aatcaacgtg 
agtggctaat aataaaaaat aacaaggtgt 
tttttagcga tgataaaaaa tctcaccgtc 
gctgatcaac acgtacgcgc cacccatata 
cag 



gcggcgaagt caaaacttac agatacaacg 60 
gtctacgata caaacaaagt taaatctggc 120 
aaatacgact aataacaagc aagacgagca 180 
ctgcatttcc ccctcatgat gagggggctt 240 
gtaggcttta atgattttac cgtcggtgtc 300 
ggtccagtgc gtcccggcat caggcgcggg 360 

363 



<210> 9 
<211> 295 
<212> DNA 

<213> Escherichia coli 



<400> 9 
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ttacagatac aacggtttca attccgataa 
aagttaaatc tggcgcactg gaggcaaccg 
aagcaagacg agcaagtggc taataataaa 
tgatgagggg gcttttttta gcgatgataa 
ttaccgtcgg tgtcgctgat caacacgtac 

<210> 10 
<211> 161 
<212> DNA 

<213> Escherichia coli 



cagccagtta ctacgtctac gatacaaaca 60 
cattaatcaa cgtgaaatac gactaataac 120 
aaataacaag gtgtctgcat ttccccctca 180 
aaaatctcac cgtcgtaggc tttaatgatt 240 
gcgccaccca tataggtcca gtgcg 295 



<400> 10 

acgactaata acaagcaaga cgagcaagtg gctaataata aaaaataaca aggtgtctgc 60 
atttccccct catgatgagg gggctttttt tagcgatgat aaaaaatctc accgtcgtag 120 
gctttaatga ttttaccgtc ggtgtcgctg atcaacacgt a 161 

<210> 11 
<211> 290 
<212> DNA 

<213> Escherichia coli 



<400> 11 

tgcggcgaag tcaaaactta cagatacaac 
cgtctacgat acaaacaaag ttaaatctgg 
gaaatacgac taataacaag caagacgagc 
tctgcatttc cccctcatga tgagggggct 
cgtaggcttt aatgatttta ccgtcggtgt 

<210> 12 
<211> 137 
<212> DNA 

<213> Escherichia coli 



ggtttcaatt ccgataacag ccagttacta 60 
cgcactggag gcaaccgcat taatcaacgt 120 
aagtggctaa taataaaaaa taacaaggtg 180 
ttttttagcg atgataaaaa atctcaccgt 240 
cgctgatcaa cacgtacgcg 290 



<400> 12 

ctggttttag cttttgcttc gtaaacacgt aataaaacgt cctcacacaa tatgaggacg 60 
ccgaatttta gggcgatgcc gaaaaggtgt caagaaatat acaacgatcc cgccatcacc 120 
aggccatctg gctgggg 137 

<210> 13 
<211> 275 
<212> DNA 

<213> Escherichia coli 



<400> 13 

cgtcatgtac gtgcagtttg gtcgtacgct 
caaaacgttc gatagcaaca acaatggatt 
gcagagtacg gattttatcg gtcattacgc 
gcgtgcgaca tcgcgacgca cttgcttcaa 
agcctgcata cgcaggttga actgctcacg 

<210> 14 
<211> 239 
<212> DNA 

<213> Escherichia coli 



tgatgaattt accgtagatc gggtgtttca 60 
tctccatttt gtcgctaaca acgcgacctt 120 
acccgccttc tcgttcagta aagtcttaac 180 
caggtgagac tgttgcagct ggccacttgc 240 
cagca 275 



<400> 14 

tgtcggtaaa caatgacaat ttgagacaga 
ataatggtta tttaaataaa ccacatgaat 
ttactcacaa tgtatcaggg aaatataact 
aaatcgaaat ttaaaatatt gtgccggagg 



gtgaaaggtc agatttgcgg agtaatgcac 60 
cattaatggg ttattcatta tttttgtgat 120 
taccgggaga tgtaatatgt ttaatttttc 180 
catctctggc acattgggca attacggca 239 



-3- 



WO 01/48209 



PCT/US00/34419 



<210> 15 
<211> 114 
<212> DNA 

<213> Escherichia coli 



<400> 15 

cgatgttctg aatctgctcg cgcatctgct caatcaacac tttcagctcg atggcggagt 60 
ttgtcacttc ggcattgata gacttcgacg caagagtgtt cgactcgcgg ttga 114 

<210> 16 
<211> 290 
<212> DNA 

<213> Escherichia coli 



<400> 16 

acgggaagta attgtaactt agtttgtaaa 
gctgacatac tgatgcaaaa tcttttagat 
agaatttata ataaatttct gcgtccgtta 
atgcatatat tctatcaagg atatcaggag 
ctttcgataa tgcgtgtttt agttgatttg 

<210> 17 
<211> 444 
<212> DNA 

<213> Escherichia coli 



ggtcatatgc ttgttgatat ttctttttgg 60 
actcatgatt ttcgaacgca taaatgcaca 120 
tattaccctt ttcatggtag tcgaatgcat 180 
ttaaaccatg tagtaatgcc agattttctc 240 
cccattcatc actattattg 290 



<220> 

<221> misc_feature 
<222> (1) . . . (444) 
<223> n = A,T,C or G 



<400> 17 

acgggaagta attgtaactt agtttgtaaa 
gctgacatac tgatgcaaaa tcttttagat 
agaatttata ataaatttct gcgtccgtta 
atgcatatat tctatcaagg atatcaggag 
ctttcgataa tgcgtgtttt agttgatttg 
ttgtttctgt gctcataata ttcctctgng 
antgactcat ngnnattntt atatnnatcc 
tgcttatggn ntantntnat ttat 



ggtcatatgc ttgttgatat ttctttttgg 60 
actcatgatt ttcgaacgca taaatgcaca 120 
tattaccctt ttcatggtag tcgaatgcat 180 
ttaaaccatg tagtaatgcc agattttctc 240 
cccattcatc actattantg aatatttcaa 300 
atgtgnctcg ngatgccnaa caattatttg 360 
nntimtaant natnantntn aagccngcnt 420 

444 



<210> 18 
<211> 250 
<212> DNA 

<213> Escherichia coli 



<400> 18 

cgctggaaaa aacctaaaat attcatctta 
ttcaagccgt atttttattt tgctgcaaat 
tagatcatct gctacagagt gtgtgaaaat 
ttttgtgagt tttgtcacca aatatcgtta 
aaacttattt 



ttccccctac gagaacccta tttggctcgt 60 
tgtactgccg atgttctgta atcagattgt 120 
ttaattcgta tcgcaaatta aacgcgtgtc 180 
ttatcactcc cttttactgg ctaaaccaga 240 

250 



<210> 19 
<211> 237 
<212> DNA 

<213> Escherichia coli 
<400> 19 

gtattacaac aagaatacag tgctaaaaaa agaatgtacc tgaagatagt cttcattttt 60 
actccaatgt ttctttattg ttatggtttg cgctgttttt gtttgttatt cgtagtaaat 120 
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atccactaac acatggctca taagttgacc agatgtaaca ctcaatgcag gatcgatttt 180 
ttgcattcgg gtgctaaatt tccaggatga accattcatg tcatccgaca ttactgg 237 

<210> 20 
<211> 237 
<212> DNA 

<213> Escherichia coli 



<400> 20 

catcatcaca gcggcagcca tgtacagcag 
cagtttgttt cagttaaaac gtagtagtgt 
acagatagac gatcgtcaga accatgaaga 
ggaaaatggc ccacggcaca ttcaggatcc 

<210> 21 
<211> 139 
<212> DNA 

<213> Escherichia coli 



atccatattc aggttttcca tgacagtctc 60 
tggtaaatta atgttcttca gacgccatcg 120 
tgaaggcttg cagcgtaatg atcaggatgt 180 
actgtgacca ccacggcaac agaccag 237 



<400> 21 

cccgcgacgt tatccgcgtg taggcatcgt catgacgtaa acggcgatcg gcggtataac 60 
ggttgatcat cgaatccatg ttgccagcag taacaccgaa aaacagattc ggtttaccca 120 
gacgcataaa gtcgtcttt 139 

<210> 22 
<211> 211 
<212> DNA 

<213> Escherichia coli 



<400> 22 

ggagagcgaa gggctatccg gccagggtga 
cgacgagtaa agtactcaaa cggcgcgctc 
agcaggaaga tggtcattcc caacaccaga 
agcatctctg acgatccacc acggaaagtg 



aattatcgcc gcgaacgcac aatttgatat 60 
cacacatgca ccagtcgggt gaacgggaag 120 
tgcaggcgga agacgaacgc tacgccgttg 180 
a 211 



<210> 23 
<211> 154 
<212> DNA 

<213> Escherichia coli 
<400> 23 

cacgacggat tttcgctgcc atgattcgct ccttagagta cttctggtgc cagagagata 60 
attttcatga acttctcact acgaagctca cgagttaccc ggcccaaaaa tacgcgtacc 120 
gataggctgc tcgctgttgt tgttcagaag aaca 154 

<210> 24 
<211> 160 
<212> DNA 

<213> Escherichia coli 



<400> 24 

acttcgtcat cacgacggat tttcgctgcc atgattcgct ccttagagta cttctggtgc 60 
cagagagata attttcatga acttctcact acgaagctca cgagttaccg gcccaaaaat 120 
acgcgtaccg ataggctgct cgctgttgtt gttcagaaga 160 

<210> 25 
<211> 244 
<212> DNA 

<213> Escherichia coli 



<400> 25 
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aatgaccttg ccggaagaca ggacattctt 
ggttaacacg ataacttcgt catcacgacg 
gtacttctgg tgccagagag ataattttca 
ccggcccaaa aatacgcgta ccgataggct 
catt 



aactttaccg cgtttacctt tatctttacc 60 
gattttcgct gccatgattc gctccttaga 120 
tgaacttctc actacgaagc tcacgagtta 180 
gctcgctgtt gttgttcaga agaacacaag 240 

244 



<210> 26 
<211> 108 
<212> DNA 

<213> Escherichia coli 
<400> 26 

tgttcgatct cgaaacttac acctttctca gccagcacaa tgcggacctg atggctatag 60 
atgtcagtag gaccggaaaa cagcgtcatt acccgaacgt ttgttggc 108 

<210> 27 
<211> 369 
<212> DNA 

<213> Escherichia coli 
<220> 

<22l> misc_feature 
<222> (1) . . - (369) 
<223> n = A,T,C or G 

<400> 27 

tccagacgct ctttcttggc gtcgtagtca aagatacccc ctaagaacgt cggagcgttc 60 
cgtgagtcct gaatgcgatt atttaccgga ttaatttcaa acatggtctg atttctttta 120 
ttgagctagt caaaatgcgg tgataagagc gggattgtac ccaatccacg ctctttttta 180 
tagagaagat gacgctaaat tggccagata ttgtcgatga taatttgcag gctgcggttc 240 
cacgaaactc gttgatatcg agcttataag ccagttgcac ttngcgcacc cgttatccgg 300 
ccagagggcg gtatcgacat taaaagcaat accatccagc agtggaccgc cgcccgaccg 360 
gttcgacca 369 

<210> 28 
<211> 457 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_f eature 
<222> (1) ... . (457) 
<223> n sa A,T / C or G 

<400> 28 

accccctaag aacgtcggag cgttccgtga ggtcctgaat gcgattattt accggattaa 60 
tttcaaacat ggtctgattt cttttattga gctagtcaaa atgcggtgat aagagcggga 120 
ttgtacccaa tccacgctct tttttataga gaagatgacg ctaaattggc cagatattgt 180 
cgatgataat ttgcaggctg cggttgccgc gaaactcgtt gatatcgagc ttataagcca 240 
gttgcacttc gcgcacgccg ttatccggcc agagggcggt atcgacatta aaagcaatac 300 
catccagcag tggaccgccg ccgaccggtt cgaccatnac cttcaaatga cgttcgccca 360 
ccagccgctg ttgcancaga cggaaatgac cgtcaaacag cggctccggg aacatctgcc 420 
cccacgggcc agcatcgcgc agcagctgcg ccacttc 457 

<210> 29 
<211> 729 
<212> DNA 

<213> Escherichia coli 
<220> 

<22l> misc feature 
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<222> (1) . .. (729) 
<223> n = A,T,C or G 



<400> 29 

ctttttacgg acaaacaatc atgcaaccag 
taatttgttc gagctttatt gaacacattg 
actaaacctg cagccaagga tgtatagtga 
agataagatt gatcttcgtt gatagaattt 
ttttggtggg agaatgataa gatcttacgt 
ggagtcgaac ctacgacctt cgcattacga 
gccctgaaag gacatgttca cgaacgtgaa 
cgatgcgtca atggccttgt gaatcaaatg 
gcacgaatgg tgtaatcacc aatgccgatc 
cattgggcca cgcgccgtgg agtttttgtg 
actggccgcc gtcggtaaaa cgcgtagagg 
gtnacaaaac gcttggcgtt ggcatatcgc 
cccgggttn 

<210> 30 
<211> 305 
<212> DNA 

<213> Escherichia coli 



aagttgaagt attaaccgat cataatgagc 60 
ccaacacatt aaatttagga gtggtgtaca 120 
gcgaagccct atcaggcctt tttggtcagt 180 
acttacacca gctgttacat taagataatt 240 
aacaatttga ttttaatggt gccgataata 300 
atgcgctgct ctaccaactg agctatatcg 360 
tcacggtgga caaggttaaa actaaccggg 420 
gctacttttg catcacccgg ttttatttac 480 
cacttgtaag tggtcaagtg cttncagccc 540 
tgcttaccgg cactttcgca cccagaccaa 600 
cgttacgtaa acagcggacg aatccacttc 660 
gggtcangat cgcaatcgga gtgttgtgtg 720 

729 



<400> 30 

agccattgtc cggagaattt tcgtctttct 
agctagagtc ttctatgttt aaatccatag 
cattttcttt ggtattttca taatcttcaa 
tctgatgaga attctttcat ctctgttaat 
gatgagcttt tatatatttc ttttgttaat 
cttgg 

<210> 31 
<211> 425 
<212> UNA 

<213> Escherichia coli 



cgagggtatt atctgaaagt aactcttctg 60 
tataattaat ttgggggacg accagataag 120 
tgattatatc cgatgtaatc atatattttt 180 
ttaattctgt tcaagcaatt gatgattttc 240 
tcattttcta atttatctgt aagagttgcc 300 

305 



<400> 31 

aggttactgg cgaggttatc gagaaacgcc 
taaaccatcc agcttaatag ctgattaata 
atgaggtaac ccacaatcag gggaaccaga 
cagtccttgc aaagatgaag tcgaaatgcg 
agtaaagttg caattaaaag gaaatgttat 
tttaaagata ttcttgcgct ttaattacaa 
ccgaggcgtt ccctgaaggc cggggcagcc 
caggg 

<210> 32 
<211> 381 
<212> DNA 

<213> Escherichia coli 



agactgatac ccataaaaaa gagaataagg 60 
acttttaacg cagcttgttg gcgaagcgga 120 
atgattaaca gcccagaaaa catgaaaacc 180 
cgatgacaca ctactgaaag cggaaggacg 240 
gcataaggag cagtagagta ttcgttttca 300 
actgcaccga tgttggtggc gtcaaaatcg 360 
cacatggatg tgggctgagg gcgcgtttta 420 

425 



<400> 32 

aaaagagaat aaggtaaacc atccagctta 
gttggcgaag cggaatgagg taacccacaa 
aaaacatgaa aacccagtcc ttgcaaagat 
aaagcggaag gacgagtaaa gttgcaatta 
agtattcgtt ttcatttaaa gatattcttg 
tggcgtcaaa atcgccgagg cgttccctga 
gagggcgcgt tttacaggga t 



atagctgatt aataactttt aacgcagctt 60 
tcaggggaac cagaatgatt aacagcccag 120 
gaagtcgaaa tgcgcgatga cacactactg 180 
aaaggaaatg ttatgcataa ggagcagtag 240 
cgctttaatt acaaactgca ccgatgttgg 300 
aggccggggc agcccacatg gatgtgggct 360 

381 



<210> 33 
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<211> 329 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1) . . . (329) 
<223> n = A,T,C or G 



<400> 33 

gtcgatgttg tgcccgaact tgccatgctc 
gaggtgctta aacgaaaaac cccgcccggt 
tgttccagtc cctacggcgc attgccgacg 
gcggatgggc gcagttggta gtagttttgc 
tgttggtttt gtgtttaaca atatttatac 
aatttttcag caagggaaaa ttgaggggn 

<210> 34 
<211> 442 
<212> DNA 

<213> Escherichia coli 



cagtcctttt cttctgggcc gactttccgg 60 
ttgcgccggc ggggttttgg aatcgtgtgt 120 
accaccacca cacgcacgac gaccactgcg 180 
gttgagcatg gaagtagtca ttggggacct 240 
aagcacagct ttacagggga gacaatggaa 300 

329 



<400> 34 

gatgttgtgc ccgaacttgc catgctccag 
gtgcttaaac gaaaaacccc gcccggtttg 
tccagtccct acggcgcatt gccgacgacc 
gatggcgcag ttggtagtag ttttgcgttg 
ggttttgtgt ttaacaatat ttatacaagc 
tttcagcaag ggaaaattga ggggttgatc 
aaccccgccg gagcgaggtt tcgtcagtcg 
attatgtcaa cactgaaagt at 



tccttttctt ctgggccgac tttccgggag 60 
cgccggcggg gttttggaat cgtgtgttgt 120 
accaccacac gcacgacgac cactgcggcg 180 
agcatggaag tagtcattgg ggaccttgtt 240 
acagctttac aggggagaca atggaaaatt 300 
acgttttgta ctgaattgca gataacaaaa 360 
cctgcggctg gtaaccgcaa agcacactgt 420 

442 



<210> 35 
<211> 272 
<212> DNA 

<213> Escherichia coli 



<400> 35 

cgattatagg gctgaccaaa tacatggata 
gtaaaaagag catatacaag gagaatagaa 
gttttactct attaacacct acgatgaatg 
gcttaccagt attcagtata attgcaaaga 
ttttacttac ctttgtatta gtaaattttt 

<210> 36 
<211> 255 
<212> DNA 

<213> Escherichia coli 



gataatatgc catcccccca ggcagtccac 60 
cacctacagc tgtaataaga taaacataah 120 
cataagagat gataaccata aagacaattt 180 
tgaatacaat aaaaacaagt aaagtaaatg 240 
ta 272 



<400> 36 

tgcgattacg gcatgcttat gatcaggaat 
ctcctactat ttaatatgta cgttccatgc 
gaaacgcaca gacatacaaa agattggctg 
aagaaaaata tgcgaaaaaa tcagccttgg 
atcacacgga tgaca 

<210> 37 
<211> 389 
<212> DNA 

<213> Escherichia coli 



gttaatgcct gctatacggg ccactatgca 60 
tgaaaagccc gttttcagga tactcaaatg 120 
gctaatctag ccagctcaac ccaactttgc 180 
cgctgtttat gcttcggctc ggcactgcaa 240 

255 
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<400> 37 

aacgtaaaac tttacgtaaa ttaacatggt 
catatttcgt aatgcttctg aatttttcgt 
cttgagggga cggcctattt ataaaatacg 
gtatgattct ggctggtcgt tgagtatcaa 
tcgtttattt taccacccat tctcttttcg 
atatccatat ccagaacttc agcaatctga 
attatggtga attccatctg agttaatgt 



taacatttat gccactattg tttgtaaatt 60 
gtgatggttt taaatactat ggtgtttact 120 
gacatttcaa taaatgcccg tataaacaga 180 
tgttggaccg aatgtgaacg aataaataat 240 
attcgttgct tgtacgcgta gatgctacgg 300 
tatggattat aaccagaaac catatacccc 360 

389 



<210> 38 
<211> 178 
<212> DNA 

<213> Escherichia coli 



<400> 38 

gtggttagga tctttatagt tacgctcggc ggcatccatc gggatacctg cggcattttc 60 
tttggcaaaa ccgatttcag aattgtgttt gtttggatga acctgaatgg agagtggctg 120 
tgctgcgcat aatactttga acaggaaagg cagttcgcca aagcgtttgg caacggcc 178 

<210> 39 
<211> 191 
<212> DNA 

<213> Escherichia coli 



<400> 39 

tcgcaaggaa aggcgtcagc gcaaaaacca 
tacgctcggc ggcatccatc gggatacctg 
aattgtgttt gtttggatga acctgaatgg 
acaggaaagg c 

<210> 40 
<211> 158 
<212> DNA 

<213> Escherichia coli 



gctccggctt gtggttagga tctttatagt 60 
cggcattttc tttggcaaaa ccgatttcag 120 
agagtggctg tgctgcgcat aatactttga 180 

191 



<400> 40 

ccggcttgtg gttaggatct ttatagttac 
cattttcttt ggcaaaaccg atttcagaat 
gtggctgtgc tgcgcataat actttgaaca 



gctcggcggc atccatcggg atacctgcgg 60 
tgtgtttgtt tggatgaacc tgaatggaga 120 
gaaaaggc 158 



<210> 41 
<211> 184 
<212> DNA 

<213> Escherichia coli 



<400> 41 

tcgaagaatt taacggaggg taaaaaaacc 
gtttcgtaat tagatagcca ccggcgcttt 
ctcaaagtct tcgaaacggt agtcgaagat 
cggc 

<210> 42 
<211> 122 
<212> DNA 

<213> Escherichia coli 



gacgcacact ggcgtcggct ctggcaggat 60 
aatgcccgga tgcggatcgt agccttcaat 120 
ggattcgggt ttacgtttga taatcaactt 180 

184 



<400> 42 

gtagaagttc aaattacgaa cctgaatttt 
aatcttttcc tcatcgtgcc tgatgcactt 

gg 



actcggggca gtttcaacca tactcattgc 60 
cgctaatcag gcttaccgta ggccggatca 120 

122 
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<210> 43 
<211> 163 
<212> DNA 

<213> Escherichia coli 
<400> 43 

taaattccgc gtatagcgtg ggcgctttgg ctccggtcgc ttcgacgagg tccgcaagag 60 
aagtcgcttc ataaccgtgt tgccagaaca atttcatggc cttatcaagc gcggcatccc 120 . 
tgtcgaacac ttttgggcgg ccacggcttt tttttacaca ttg 163 

<210> 44 
<211> 161 
<212> DNA 

<213> Escherichia coli 
<400> 44 

gcaaatgttt tcgtaattca ggagaaatgg caaataaagc attaaaaatt tgaatgcttt 60 
gtgtaataaa aaagcagaca ggcgacggag tgaccactcc gtcgctttac aaagagagga 120 
aaatcatagg ttgccggtgt agtgccagcg taaataacgc a 161 

<210> 45 
<211> 251 
<212> DNA 

<213> Escherichia coli 
<400> 45 

ggctgtaacg gtaaagccct caccgaagcg 
cttgtcatca tactgattgt actgttactc 
tcaggggagg agcaatcctc ccttaccctt 
tgtcaatcag ggcgatgcgg ttgtatcgcc 
ccaccgcagg a 

<210> 46 
<211> 286 
<212> DNA 

<213> Escherichia coli 
<400> 46 

caccgaagcg agggcttgaa ggagaagggt tatgatgcga cttgtcatca tactgattgt 60 
actgttactc ataagtttca gcgcttatta acagtcagtc tcaggggagg agcaatcctc 120 
ccttaccctt actcactaaa ttaggtcaaa gaatcaacga tgtcaatcag ggcgatgcgg 180 
gtgtatcgcc cttaccactc ccagactttc gacggtgtaa ccaccgcagg aagagggata 240 
tcccactctt caacggggag tttttccacc aactgacaat catgcg 286 

<210> 47 
<211> 180 
<212> DNA 

<213> Escherichia coli 
<400> 47 

ccggggctga cgtgggcgat aatcgggtcg ccaggaatag ggcggcagca tttcgcaaag 60 
gtgatcagca cgccatcggc acctttaatg ggcagatgtc cgtggctttg ggttgccggt 120 
ggaatggagg cgtccccatg ttgcagattt ttcgcgacca ccacgctcat tgcgttacca 180 



<210> 48 
<211> 254 
<212> DNA 

<213> Escherichia coli 
<400> 48 



agggcttgaa ggagaagggt tatgatgcga 60 
ataagtttca gcgcttatta acagtcagtc 120 
actcactaaa ttaggtcaaa gaatcaacga 180 
cttaccactc ccagactttc gacggtgtaa 240 

251 
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tgtcgagcat acgcggcagc gccgactcgg 
cttcacggat gtagcggata aatttgaaga 
ccaataccag caccacaaac aggatacagg 
gttgcaccag tgtgccgacg ccgtatttac 
caataggtgc caga 

<210> 49 
<211> 300 
<212> DNA 

<213> Escherichia coli 



aagatgaagt ccccagtaca atcagcagtt 60 
tactgaaacc agtcgcttta gcgattgaac 120 
taatgtagaa acagataatc agctgcccca 180 
cgatggtaaa cgccattgcc ccgaacgcac 240 

254 



<220> 

<221> misc_feature 
<222> (1) . . . (300) 
<223> n = A,T,C or G 



<400> 49 

agctgatcga tggtgtttgc caacgcacac 
ccgccgagca ccaccggctt acgcccgaag 
cccaacgcaa agcccagaat ataagtactg 
ccaaactgcg ctgaaattac cggtagcgcg 
aacatggcca gcaggccaag gataaaaaca 



gccaccgcgg cggcggcaaa caccancgta 60 
ctgtctgcca tcggcccgta gattaactgc 120 
agggtcatct gcgtactgcc cgncggtacg 180 
ggcagataca tatcaatcga cagcggcatc 240 
atagcaaacg acgaatgctg tcgggtggtc 300 



<210> 50 
<211> 207 
<212> DNA 

<213> Escherichia coli 



<400> 50 

caacatctca tgtctggatt tgtagatata 
attcaacata tcatgtaagc gctgggagct 
ggagaacagg ataaagatat tatcgtttcg 
attatcttca gtttctataa atatagg 



cgtggaattc caattggtat cttttgtagg 60 
taccaccata actggcccat catcgactga 120 
tgttaattca ttcgtgttaa aggttcttaa 180 

207 



<210> 51 
<211> 213 
<212> DNA 

<213> Escherichia coli 
<400> 51 

ttctgtaatc agaaaaagaa gaactggaat 
ggggtataat caatatgacg agttgaatat 
aagaaatctg ccgtaaagac agatttcttt 
ataaactttt tgtacatcgt cgtcatcttc 



tttaataaat attatttctc tgggaacggg 60 
tatatttttt ataagaatta tatgagatta 120 
aaaagataat tagagatttg cgacgttatg 1B0 
aag 213 



<210> 52 
<211> 381 
<212> DNA 

<213> Escherichia coli 
<400> 52 

tctgttagtg tattatccac tgcggccctt 
aggaaaggat gttccgtggc cgtaaatgca 
tatcgccagt ggtgctgtcg tgatgcggtc 
gcttttgtat cgtacttatt gtttctggtg 
gtctctccgg cgacaattta ctggtggtta 
tgccgccctt tgggcggcat attttagatt, 
ccaccccagc gacgttcata g 



tccgccgtct cgcaaacggg cgctggcttt 60 
ggtgtttcac agcgcttgct atcgcggcaa 120 
ttcgcatgga ccgcacaatg aagatacggt 180 
cgctgttaac cgaggtaaat aataaccgga 240 
acaaccttca gagcagcaag taagcccgaa 300 
atccgattct gtttaaagtc acgcaaaaaa 360 

381 



<210> 53 
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<211> 154 
<212> DNA 

<213> Escherichia coli 



<400> 53 

aaatagaccg gcatactttc gtgcggcagc 
atgataaagg tgtagcatag atatttactt 
ccaccaatcg gtccaccaat cattttcaga 



ataatcaaca gtacgagcgc agcggtacta 60 
ggcgatttca ggatcttatc tgaaatcatg 120 
cagt 154 



<210> 54 
<211> 191 
<212> DNA 

<213> Escherichia coli 



<400> 54 

tccttagtca ggtactgacg tactttaaaa 
gtgttcgcaa accaggtaga gttccatggt 
tgtactttct gacccattgc tagtctccag 
tgtggctggt g 

<210> 55 
<211> 190 
<212> DNA 

<213> Escherichia coli 



tcgctgtcca ggttgtcagc gaattctttg 60 
tttacaatac ccaggcgaat accattagga 120 
agtctcagcg atcggacaca accacagtga 180 

191 



<400> 55 

gctggtgccc gttttccatg ccagtggggc 
ttgcgcttca tcagccataa tccggcgaat 
ccgttcaagc agcggatcgt caggctgtaa 
aaacgcggta 

<210> 56 
<211> 402 
<212> DNA 

<213> Escherichia coli 



gacgcgcggc aaggcactat ccggcaaggg 60 
gatccacgcc gcccccgacg acattaaagg 120 
gcgcaatttg cctgccttgc cgtggcgagc 180 

190 



<400> 56 

aaaaaatgaa attcctcttt gacgggccaa 
atttgcggca aattcccttc tccatacagg 
cccggcggca tggcgcgtaa tttatcccgc 
tcgtacagcg actgactgaa cgtattgggc 
ttcagccacc agctcatcat ttgttgcagc 
gtatcgtcca gcagcgccag tcctttgtca 
tcagcccctt ccagttcaat cacgggttta 

<210> 57 
<211> 595 
<212> DNA 

<213> Escherichia coli 



tagcgatatt ggccattttt ttagcgcaac 60 
tgtagtgcac cgaccgcgac cacatatcgc 120 
caggcgagat ttcgctgatg catcagcaca 180 
agcgttatat cattattttg cggcggtgca 240 
aaccgtgcgt tggtatgcca gtgggtcagc 300 
gggagctgga gcaacatggc aatctggttt 360 
tgttgttgct tc 402 



<220> 

<221> raisc_feature 
<222> (1)...(595) 
<223> n « A,T,C or G 

<400> 57 

aattaatcag agcaacggta aaacaatgaa 
gagaaaaaaa gaaaatttga tggagagtga 
agagattatg aagaactata ccggatgact 
atctgtatta atttataaga aagcaactta 
attagcttat tttttcgagc attaatcccg 



agtgtaaaaa acacttttgc gccaattatg 60 
tgagagaata ttacaacacg atgattttgc 120 
ggtgataaat aaagcaaata accaggatta 180 
atacccgcag aatgatttct gcgggtaagt 240 
cgcgtaatcc caacgctacc aacggattag 300 
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ggaataacac atactctaca tcatgggtta 
gcaatgttcc tttctcaaac ggcataaaat 
aggacggcga gtggcgagta atttgcgaaa 
cgataccgac actctcacca gatagcagcg 
ggcgaagatc gttttgccca aagggcaacg 



cggtaaaacg ttcctctccg tcctgcgcca 360 
tcagcgtgtc acttgccata tgcatttcga 420 
ccacccgata acggagcggc ggtgttctca 480 
cagcaattgc gctggcagtt actgcaaact 540 
ctttgccaag ttncagccgt acang 595 



<210> 58 
<211> 250 
<212> DNA 

<213> Escherichia coli 



<400> 58 

gaaactcagt ggaatggggg agccgcaata 
tgcgcggtga atatgcccga gcgcgatgta 
cgcgtccagc gtgccaatat aaatgtcacg 
cgttaaatgt cccgtggcga tgatgggcag 
ggcatagtgt 



gcgaacatgt tccatgccgc caataatctg 60 
gtcggctggt ggaaagtttt gtgccggaaa 120 
cacggcgtca cttttactgg cccccacggt 180 
aggctgatcg ccgcgcagtt tgcaggcatc 240 

250 



<210> 59 
<211> 236 
<212> DNA 

<213> Escherichia coli 



<400> 59 

aaaggttcgg caaaagccat aaccttttcg 
gacgtttcga ccattttata acgggtaaaa 
ctgccaaaaa aacgcatatc tgtacactga 
tgaccgcacg gtttagacgg attaatggct 

<210> 60 
<211> 92 
<212> DNA 

<213> Escherichia coli 



tcagctttgg ccggataagg gacacattgc 60 
ccaccatcaa catgagggaa atacatggca 120 
ttctcgttat gttcaatgca gtatttgcag 180 
accgtttgcc cttcatgtaa ttctga 236 



<400> 60 

gaagagatgt tcaggttttc gttatcggca atggtatcga acttgtattt ctcatacttc 60 
tcgtcaggcg tggagtacgc cgcgccacgg aa 92 

<210> 61 
<211> 62 
<212> DNA 

<213> Escherichia coli 



<400> 61 

tgtcgacatt cagcacttcc gggtattcgt cgcgcagggc aaatgtacag gttgaggagg 60 
tg 62 



<210> 62 
<211> 72 
<212> DNA 

<213> Escherichia coli 



<400> 62 

acttatcaaa ccatttttcc gcttcaccgg aggtctgcac ctgagcgatg gtgtcatcca 60 
tcagcttttt ga 72 

<210> 63 
<211> 66 
<212> DNA 

<213> Escherichia coli 
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<400> 63 

aaacgcatgg cgcgtaaagg ttcaatcagt ccgatttcag cggtgtcgaa ctcgctaaat 60 
tcttca 66 



<210> 64 
<211> 143 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1)...(143) 
<223> n =5 A, T, C or 6 

<400> 64 

tataggggnc tatttcatta acccgacact ggattaattc aacaggaata ttatgattct 60 

gacacaactc tttaatttct tccgccgcca tcgtagaggt cgcaaccgcg cctccgcaag 120 

cgacaataat cttgcgtttc ata 143 

<210> 65 
<211> 210 
<212> DNA 

<213> Escherichia coli 



<400> 65 

gaggaaccgt gtgacacaca aagggtgatt 
accatcatgt attttcatgg ctaaaatact 
tgctattgga gtggttaatg cattcagaaa 
catggagata taaagtccag gtgtttcgtt 

<210> 66 
<211> 118 
<212> DNA 

<213> Escherichia coli 



attttccatt gcggcgtata atcctggctc 60 
catcaccagt tttttgttta aaagctcttg 120 
gatatcgatt atttcatcga atccatacaa 180 

210 



<400> 66 

agagctgtga aggagaaaat acgaccacct ttaacggttt tagatacgcg gtttaccgcg 60 
atcagctttt cctgcagttc gccagcttgt ttttcgatgt gagccatctt acacctct 118 

<210> 67 
<211> 531 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> mis cofeature 
<222> (1) . . . (531) 
<223> n = A,T/C or G 



<400> 67 

cctttcaacg cagtagaggg gaatgncncc 
cagcatatag- cccattccgt cttgatcggg 
aacctggtcg caggtggcgt aattgaggcc 
tttttgcgtg acggcaacaa agcggtcgcc 
ttcggtattg tccgtggagc cgtcgttaac 
cagtaantta tccagtgact ggcgccagac 
cgtcnaatac agcctttttt tttantgggt 
gcaaccattg ttttngacct tcanaatnaa 
gcaacttgnt ggacagggca aaaaattcct 



tacggcatcg gcaccgcgct caatttctgc 60 
cggaacatag gtgtcggcat cacttaaaaa 120 
attcatcagc gcaccacctt taccggtatt 180 
ccatttgcgt ttgacttccg ccatgaccgc 240 
gcaaattacc cggcaaaaat aangggttcg 300 
acgggccttt tttntncgca gntttaatgg 360 
ttaccntnta nccgtnttta aaaacccata 420 
aaattcnggg ttttaaaagc gcgttttccg 480 
gcctgggtcg ccgnatctgn g 531 



<210> 68 
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<211> 102 
<212> DNA 

<213> Escherichia coli 
<400> 68 

agcgccggta acgcccgtta aatgttctcg ttcggcttcc cgatcgttat tcaacaggct 60 
gcgtaccgta gcggcatatt tcttccctgc ttcatcgcca tc 102 

<210> 69 
<211> 167 
<212> DNA 

<213> Escherichia coli 



<400> 69 

aaaactcacc ttttttgttg ttatccctca 
aatgacctag acgtaattat gtaaaaatac 
cagctgttga aaaatcaaaa aactggaaaa 



attaccacaa gtaaaaggag ataaaccgac 60 
tgatgttcgt cactgactat ttccgatatt 120 
ataattttaa atatcat 167 



<210> 70 
<211> 83 
<212> DNA 

<213> Escherichia coli 
<400> 70 

gccacgccca gcataaacag cgggatcgag ccttcaatcg gtacgcccag tacacctttc 60 
accatcagca ccagcgataa tec 83 

<210> 71 
<211> 103 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> mis cofeature 
<222> (1) . . . (103) 
<223> n as A,T,C or G 

<400> 71 

gecnggeggg aaccgacatg ttcccactct acgccggctg gcgcggtttt catttttgeg 60 
cacctgatag ccgcacgatt gcagatcaac aacgectttt ttg 103 



<210> 72 
<211> 121 
<212> DNA 

<213> Escherichia coli 



<400> 72 

tagcttcttc cacattgtcg aactgaacgt ecagaeggtg tttagaaacg ttgatgeteg 60 
ggattttttc cgcacccagg caaacgcgcg ccatggctgc aacggtttct gaeggatact 120 
g 121 

<210> 73 
<211> 163 
<212> DNA 

<213> Escherichia coli 



<400> 73 

tagttattgg ctgttttgag aatgtaatct 
aatccgagaa teacatgeaa tttattcata 
gcatagttga ccagaccgtc gagtegctge 



tccaactgct tataactctt cagatgeaat 60 
aattcgtggg atcgttcacg aagtgcgtca 120 
atcagtttac gta 163 
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<210> 74 
<211> 106 
<212> DNA 

<213> Escherichia coli 
<400> 74 

aaaaccatga ggttattatg gccgatttga ggagggaaag agtaagagca gtttgttaaa 60 
tgtacaacga cgattctccc accgggcgcg ttttaaagcg acggtg 106 



<210> 75 
<211> 319 
<212> DNA 

<213> Escherichia coli 



<400> 75 

ctggagattg agtagatatt cttgttcaga 
aagccacgaa gagttcagat agtacaacgg 
cagtgtgttc tctcacgcat cacaaaagca 
ttttcacaaa gcttcagcaa attggcgatt 
tagccttatg acgtgcagct ttgtttttgt 
cgatcggttg catttcgtt 

<210> 76 
<211> 237 
<212> DNA 

<213> Escherichia coli 



atgtatcagc ccgatggttc tacgattctt 60 
catgtctctt ttgactatct ggcaacccgg 120 
gcaggcataa aaaaacccgc ttgcgcggct 180 
aagccagttfc gttgatctgt gcagtcaggt 240 
ggatcaaacc tttagcagcc tgacggtcca 300 

319 



<400> 76 

cgaaccaacg acccccacca tgtcaaggtg 
cgttgtaggt gacaacgggg acgaatatta 
aacgtaattt tctgcgctat ttcgaccgtt 
ttgttgattt gcaagggtga tttttattca 



gtgctctaac caactgagct atgaacgcaa 60 
gcggcagagt gggaaggtgg caagaggcaa 120 
tgcagagctt ttaagcaaat tggctatatt 180 
ggatcgcatt tacatctgat acaaccc 237 



<210> 77 
<211> 241 
<212> DNA 

<213> Escherichia coli 



<400> 77 

tgtgacagag tggttatcgt taatcaggca 
atagcgcata aaatatcctt ataattaaca 
attttctacg actgtcaaaa atcgctcatt 
ggaaaaggtg atgcatttgg gagaggaaga 

g 



aagagggaaa aacatattat ttaaaccatt 60 
agaaagaaaa ggcatattct ctgcattatc 120 
ttttaatgag tttatttgtt taatattatg 180 
gtattccccg gtcagacgac cggggaaggg 240 

241 



<210> 78 
<211> 89 
<212> DNA 

<213> Escherichia coli 



<400> 78 

cccggctggg ttttttcaag tttacgcagg ggtgcgggcg atcacttccg ggttgctggc 60 
agcttcgcct ttcggcgaaa tgctgtggt 89 

<210> 79 
<211> 140 
<212> DNA 

<213> Escherichia coli 



<400> 79 

tcgcgaacac cgtagtggat gtagttaccc gcagcatctt cgttgattgc tttagaacca 60 
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gaccacaggg tcaggttaga cggcgccagg tcagcagaac cgccgaggaa ttccggcaac 120 
agcggaccga acgcttcgat 140 

<210> 80 
<211> 189 
<212> DNA 

<213> Escherichia coli 
<400> 80 

gttagtaaag ccgatgaggt gagcagtcac ttcgccggac ggatagtaac ggcgagactc 60 
ttcacgcaga tgaatccccg gcagtttcag ttttttgatg tagtccgcca tgtcagggtt 120 
cacctgacgc gccagataaa taaagcgccc tttcgggttg gcgttaatgc gggctgaaag 180 
ctgatccag 189 

<210> 81 
<211> 347 
<212> DNA 

<213> Escherichia coli 
<400> 81 

gcagttcacc ggcacgcgct ttcttatata agcctttggg atcgcgggct tcgcaaatcg 60 
ccagcggcgt atcgacaaac acttcgataa agcgcccttc tcctacgcgt tcgcgaacca 120 
tctggcgttc ggcgcggtgt ggcgagataa atgcggtcag caccaccagt ccggcttcaa 180 
ccatcaaatt cgccacttca ccgacgcgac ggatattctc tttacgatcg gcatcgctaa 240 
aaccgagatc gctgcataat ccgtggcgaa cattgtcgcc atccagcaga tacgtactga 300 
cgccgagttt atgtaacgcc tcctccagcg ccccggcgac cgttgat 347 

<210> 82 
<211> 67 
<212> DNA. 

<213> Escherichia coli 
<400> 82 

ccctgacgat cagtgatagt cacgatggtg ttgttgaaag aagcatggat atgagccacg 60 
ccgtcag 67 

<210> 83 
<211> 176 
<212> DNA 

<213> Escherichia coli 
<400> 83 

agcaaagcga gccgaacatg gtggcgaggc tgccgtaaag taaaaactgg gtcgccaact 60 
ccgggtgatc catcacgtat ttcacgaagt agagcgtcgc cccgccgcgc accacgttgg 120 
agcaggtcgc catcatcttg aacgcgcaca tgatgcgcca ctggctgttg cccagc 176 

<210> 84 
<211> 632 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (632) 
<223> n = A,T,C or G 

<400> 84 

tnncannnnc nnnnnrmTmn nnrinrmcncn ntncctagan tntcnangtc tncncncncn 60 
tnnccnnncc cttcnnrmcc nnnnanangn ncrmcncccc cxmncccnnt tttgttttgg 120 
aaantccgtt tcaggaanag cttctgaatc ccgancgntt gataactgtm ggccagagtc 180 
ataatgcgca ccaataataa tcagcggncc atcggcagga ccataatcag caacaatgnn 240 
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nnggtagggg ccacccgtaa tggggaccgt 
caaagacctc ttttatatat tctgcgggac 
acgtggatgc acaagtttgt gtaagataac 
gcgtgtcggg gccgttgacg gtaatgcatt 
gagtaaaaac acaaataata taataaaagc 
taatggggtt gcgtttggtg aatcattatn 
ttaaagatta agcangggat tataccaaaa 



cctgcgaggt aactctggca ccgctactga 300 
ctattcagat tatcaatatt gtcnggcact 360 
gaactgnttt ttctaattgc tcaggacttt 420 
gacccagggt tggtaaaaaa taatcatagg 480 
caagattatt tttttcatat gcaaaattat 540 
ttgctatggg tttcgatatt gngattttaa 600 
ga 632 



<210> 85 
<211> 161 
<212> DNA 

<213> Bscbierichia coli 



<400> 85 

gcgctattca gcgtaattgg cagcagctgc ggatcgtcat caatcttctc tgacagcggc 60 
tgatgtactt caaccagacg cgcaccgttt ggttcggcag agacttttat cggagtgttg 120 
atgatattca ctttggtgcc tggggtgacc tggctaaaga g 161 

<210> 86 
<211> 188 
<212> DNA 

<213> Escherichia coli 



<400> 86 

caagggtgag catttgcagc gaatcagaaa gggcgcagaa cagtgaagac aacagaaaaa 60 
cgaccagacc gcafcttataa atacgfccgat agccaaacat atcgcccaga aacgaaaacg 120 
agagcaggga gatgacaatg gcgatttgat aggcgttcac tacccagatg gaactggctg 180 
gcgtggca 188 

<210> 87 
<211> 175 
<212> DNA 

<213> Escherichia coli 



<400> 87 

tgagcatttg cagcgaatca gaaagggcgc 
gaccgcattt ataaatacgt cgatagccaa 
gggagatgac aatggcgatt tgataggcgt 



agaacagtga agacaacaga aaaacgacca 60 
acatatcgcc cagaaacgaa aacgagagca 12 0 
tcactaccca gatggaactg gctgg 175 



<210> 88 
<211> 194 
<212> DNA 

<213> Escherichia coli 



<400> 86 

cgcaagggtg agcatttgca gcgaatcaga 
aacgaccaga ccgcatttat aaatacgtcg 
cgagagcagg gagatgacaa tggcgatttg 
tggcgtggca tgaa 



aagggcgcag aacagtgaag acaacagaaa 60 
atagccaaac atatcgccca gaaacgaaaa 120 
ataggcgttc actacccaga tggaactggc 180 

194 



<210> 89 
<211> 272 
<212> DNA 

<213> Escherichia coli 



<400> 89 

ccttgtataa ggaaagggtt atgatgaagc 
taagtttccc gacttactaa caactcatca 
tcctttactc taggttgaaa aaacaacagc 
tctgtgaacc gctttccggt tagccttttt 
tcgtcgagca accgtcttca aagttgagca 



teg tea teat actggttgtg ttgttactgt 60 
gaggggggag aaatcctccc ttacccttgt 120 
gtcaataggc ctgccatgta egaagegaga 180 
tatcctgttg gatcttcttg atgatgttgg 240 
cc 272 
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<210> 90 
<211> 245 
<212> DMA 

<213> Escherichia coli 



<400> 90 

ccttgtataa ggaaagggtt atgatgaagc 
taagtttccc gacttactaa caactcatca 
tcctttactc taggttgaaa aaacaacagc 
tctgtgaacc gctttccggt tagccttttt 
tcgtc 

<21Q> 91 
<211> 203 
<212> DNA 

<213> Escherichia coli 



tcgtcatcat actggttgtg ttgttactgt 60 
9 a 9gg9ggag aaatcctccc ttacccttgt 120 
gtcaataggc ctgccatgta cgaagcgaga 180 
tatcctgttg gatcttcttg atgatgttgg 240 

245 



<400> 91 

taccaaaaaa agccacgtta tcttgttgat 
ccagttatat cagtagaaaa cctggttgtt 
atttttttga taaaaattaa attttatttg 
ccagggctgg aacctgaagt tta 

<210> 92 
<211> 189 
<212> DNA 

<213> Escherichia coli 



gcaaaagagt gaacgtggcg ttaaatgtaa 60 
gttaacagtc taaccggtca attttttatg 120 
ctttaatcac caccagatga cgttcgccat 180 

203 



<220> 

<22l> misc_feature 
<222> (1) . . . (189) 
<223> n = A,T,C or G 



<400> 92 

ctaatgactt attttctgct taccaaaaaa 
gaacgtggcg ttaaatgtac cagttatatc 
aaccggcaat tttttatgat ttttttgata 
ccagatgac 



agccacgtta tcttgttgat gcnaaagagt 60 
agtagaaaac ctggttgntg ntaacagtct 120 
aaaattaaat tttatttgct ttaat caeca 180 

189 



<210> 93 
<211> 221 
<212> DNA 

<213> Escherichia coli 



<400> 93 

agegcaacag eggaaaccag ggtagccagt 
agtcgttatt actgettttt gttgtctacc 
acgatcatct ggtaatagga ggaaacacct 
agaattaatg cgccgatcaa cgtcccaaca 

<210> 94 
<211> 117 
<212> DNA 

<213> Escherichia coli 



tttttcatgt tcatattcaa gatgtcctgt 60 
agcaccgcca gcaaaatcac caccgctttg 120 
aacaaattca atccattatt aaggaageca 180 
atgegacett t 221 



<220> 

<221> CDS 

<222> (1) . . . (117) 



<400> 94 
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atg aaa gtt cgt get tec gtc aag aaa tta tgc cgt aac tgc aaa ate 48 
Met Lys Val Arg Ala Ser Val Lys Lys Leu Cys Arg Asn Cya Lys lie 
15 10 15 

gtt aag cgt gat ggt gtc ate cgt gtg att tgc agt gee gag ccg aag 96 
Val Lys Arg Asp Gly Val lie Arg Val lie Cys Ser Ala Glu Pro Lys 
20 25 30 

cat aaa cag cgc caa ggc tga 117 
His Lys Gin Arg Gin Gly * 
35 



<210> 95 
<211> 1332 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1332) 

<400> 95 

atg get aaa caa ccg gga tta gat ttt caa agt gee aaa ggt ggc tta 48 
Met Ala Lys Gin Pro Gly Leu Asp Phe Gin Ser Ala Lys Gly Gly Leu 
15 10 15 

ggc gag ctg aaa cgc aga ctg ctg ttt gtt ate ggt gcg ctg att gtg 96 
Gly Glu Leu Lys Arg Arg Leu Leu Phe Val lie Gly Ala Leu lie Val 
20 25 30 

ttc cgt att ggc tct ttt att ccg ate cct ggt att gat gee get gta 144 
Phe Arg lie Gly Ser Phe lie Pro lie Pro Gly lie Asp Ala Ala Val , 
35 40 45 

ctt gec aaa ctg ctt gag caa cag cga ggc ace ate att gag atg ttt 192 
Leu Ala Lys Leu Leu Glu Gin Gin Arg Gly Thr lie He Glu Met Phe 
50 55 60 

aac atg ttc tct ggt ggt get etc age cgt get tct ate ttt get ctg 240 
Asn Met Phe Ser Gly Gly Ala Leu Ser Arg Ala Ser He Phe Ala Leu 
65 70 75 80 

ggg ate atg ccg tat att teg gcg teg ate att ate cag ctg ctg acg 288 
Gly lie Met Pro Tyr He Ser Ala Ser He He He Gin Leu Leu Thr 
85 90 95 

gtg gtt cac cca acg ttg gca gaa att aag aaa gaa ggg gag tct ggt 336 
Val Val His Pro Thr Leu Ala Glu He Lys Lys Glu Gly Glu Ser Gly 
100 105 110 

cgt cgt aag ate age cag tac acc cgc tac ggt act ctg gtg ctg gca 384 
Arg Arg Lys He Ser Gin Tyr Thr Arg Tyr Gly Thr Leu Val Leu Ala 
115 ^ 120 125 

ata ttc cag teg ate ggt att get acc ggt ctg ccg aat atg cct ggt 432 
He Phe Gin Ser He Gly He Ala Thr Gly Leu Pro Asn Met Pro Gly 
130 135 140 

atg caa ggc ctg gtg att aac ccg ggc ttt gca ttc tac ttc acc get 480 
Met Gin Gly Leu Val He Asn Pro Gly Phe Ala Phe Tyr Phe Thr Ala 
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145 150 155 160 

gtt gta agt ctg gtc aca gga acc atg ttc ctg atg tgg ttg ggc gaa 528 
Val Val Ser Leu Val Thr Gly Thr Met Phe Leu Met Trp Leu Gly Glu 
165 170 175 

cag att act gaa cga ggt ate ggc aac ggt att tea ate att ate ttc 576 
Gin He Thr Glu Arg Gly He Gly Asn Gly He Ser He He He Phe 
180 185 190 

gee ggt att gtc gcg gga etc ccg cca gee att gee cat act ate gag 624 
Ala Gly He Val Ala Gly Leu Pro Pro Ala He Ala His Thr lie' Glu 
195 200 205 

caa gcg cgt caa ggc gac ctg cac ttc etc gtg ttg ctg ttg gtt gca 672 
Gin Ala Arg Gin Gly Asp Leu His Phe Leu Val Leu Leu Leu Val Ala 
210 215 220 

gta tta gta ttt gca gtg acg ttc ttt gtt gta ttt gtt gag cgt ggt 720 
Val Leu Val Phe Ala Val Thr Phe Phe Val Val Phe Val Glu Arg Gly 
225 230 235 240 

caa cgc cgc att gtg gta aac tac gcg aaa cgt cag caa ggt cgt cgt 768 
Gin Arg Arg He Val Val Asn Tyr Ala Lys Arg Gin Gin Gly Arg Arg 
245 250 255 

gtc tat get gca cag age aca cat tta ccg ctg aaa gtg aat atg gcg 816 
Val Tyr Ala Ala Gin Ser Thr His Leu Pro Leu Lys Val Asn Met Ala 
260 265 270 

ggg gta ate ccg gca ate ttc get tec agt att att ctg ttc ccg gcg 864 
Gly Val He Pro Ala He Phe Ala Ser Ser He He Leu Phe Pro Ala 
275 2B0 285 

acc ate gcg tea tgg ttc ggg ggc ggt act ggt tgg aac tgg ctg aca 912 
Thr He Ala Ser Trp Phe Gly Gly Gly Thr Gly Trp Asn Trp Leu Thr 
290 295 300 

aca att teg ctg tat ttg cag cct ggg caa ccg ctt tat gtg tta etc 960 
Thr He Ser Leu Tyr Leu Gin Pro Gly Gin Pro Leu Tyr Val Leu Leu 
305 310 315 320 

tat gcg tct gca ate ate ttc ttc tgt ttc ttc tac acg gcg ttg gtt 1008 
Tyr Ala Ser Ala He He Phe Phe Cys Phe Phe Tyr Thr Ala Leu Val 
325 330 335 

ttc aac ccg cgt gaa aca gca gat aac ctg aag aag tec ggt gca ttt 1056 
Phe Asn Pro Arg Glu Thr Ala Asp Asn Leu Lys Lys Ser Gly Ala Phe 
340 345 350 

gta cca gga att cgt ccg gga gag caa acg gcg aag tat ate gat aaa 1104 
Val Pro Gly He Arg Pro Gly Glu Gin Thr Ala Lys Tyr He Asp Lys 
355 360 365 

gta atg acc cgc ctg acc ctg gtt ggt gcg ctg tat att acc ttt ate 1152 
Val Met Thr Arg Leu Thr Leu Val Gly Ala Leu Tyr He Thr Phe He 
370 375 380 

tgc ctg ate ccg gag ttc atg cgt gat gca atg aaa gta ccg ttc tac 1200 
Cys Leu He Pro Glu Phe Met Arg Asp Ala Met Lys Val Pro Phe Tyr 
385 390 395 400 
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ttc ggt ggg acc tea ctg ctt ate gtt gtt gtc gtg att atg gac ttt 1248 
Phe Gly Gly Thr Ser Leu Leu lie Val Val Val Val lie Met Asp Phe 
405 410* 415 

atg get caa gtg caa act ctg atg atg tec agt cag tat gag tct gca 1296 
Met Ala Gin Val Gin Thr Leu Met Met Ser Ser Gin Tyr Glu Ser Ala 
420 425 430 

ttg aag aag gcg aac ctg aaa ggc tac ggc cga taa 1332 
Leu Lys Lys Ala Asn Leu Lys Gly Tyr Gly Arg * 
435 440 



<210> 96 
<211> 435 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (435) 

<400> 96 

atg cgt tta aat act ctg tct ccg gec gaa ggc tec aaa aag gcg ggt 48 

Met Arg Leu Asn Thr Leu Ser Pro Ala Glu Gly Ser Lys Lys Ala Gly 
15 10 15 

aaa cgc ctg ggt cgt ggt ate ggt tct ggc etc ggt aaa acc ggt ggt 96 
Lys Arg Leu Gly Arg Gly lie Gly Ser Gly Leu Gly Lys Thr Gly Gly 
20 25 30 

cgt ggt cac aaa ggt cag aag tct cgt tct ggc ggt ggc gta cgt cgc 144 
Arg Gly His Lys Gly Gin Lys Ser Arg Ser Gly Gly Gly Val Arg Arg 
35 40 45 

ggt ttc gag ggt ggt cag atg cct ctg tac cgt cgt ctg ccg aaa ttc 192 
Gly Phe Glu Gly Gly Gin Met Pro Leu Tyr Arg Arg Leu Pro Lys Phe 
50 55 60 

ggc ttc act tct cgt aaa gca gcg att aca gee gaa att cgt ctg tct 240 
Gly Phe Thr Ser Arg Lys Ala Ala lie Thr Ala Glu lie Arg Leu Ser 
65 70 75 80 

gac ctg get aaa gta gaa ggc ggt gta gta gac ctg aac acg ctg aaa 288 
Asp Leu Ala Lys Val Glu Gly Gly Val Val Asp Leu Asn Thr Leu Lys 
85 90 95 

gcg get aac att ate ggt ate cag ate gag ttc gcg aaa gtg ate ctg 336 
Ala Ala Asn lie He Gly He Gin He Glu Phe Ala Lys Val He Leu 
100 105 110 

get ggc gaa gta acg act ccg gta act gtt cgt ggc ctg cgt gtt act 384 
Ala Gly Glu Val Thr Thr Pro Val Thr Val Arg Gly Leu Arg Val Thr 
115 120 125 

aaa ggc get cgt get get ate gaa get get ggc ggt aaa ate gag gaa 432 
Lys Gly Ala Arg Ala Ala He Glu Ala Ala Gly Gly Lys He Glu Glu 
130 135 140 

taa 435 
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* 



<210> 97 
<211> 180 
<212> DNA 

<2X3> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (180) 

<400> 97 

atg gca aag act att aaa att act caa acc cgc agt gca ate ggt cgt 
Met Ala Lys Thr lie Lys lie Thr Gin Thr Arg Ser Ala lie Gly Arg 
15 10 15 



48 



ctg ccg aaa cac aag gca acg ctg ctt ggc ctg ggt ctg cgt cgt att 96 
Leu Pro Lys His Lys Ala Thr Leu Leu Gly Leu Gly Leu Arg Arg He 
20 25 30 

ggt cac acc gta gag cgc gag gat act cct get att cgc ggt atg ate 144 
Gly His Thr Val Glu Arg Glu Asp Thr Pro Ala He Arg Gly Met He 
35 40 45 

aac gcg gtt tec ttc atg gtt aaa gtt gag gag taa 180 
Asn Ala Val Ser Phe Met Val Lys Val Glu Glu * 
50 55 



<210> 98 
<211> 504 

<212> DNA ! 
<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (504) 

<400> 98 

atg get cac ate gaa aaa caa get ggc gaa ctg cag gaa aag ctg ate 48 
Met Ala His He Glu Lys Gin Ala Gly Glu Leu Gin Glu Lys Leu He 
15 10 15 

gcg gta aac cgc gta tct aaa acc gtt aaa ggt ggt cgt att ttc tec 96 
Ala Val Asn Arg Val Ser Lys Thr Val Lys Gly Gly Arg He Phe Ser 
20 25 30 

ttc aca get ctg act gta gtt ggc gat ggt aac ggt cgc gtt ggt ttt 144 
Phe Thr Ala Leu Thr Val Val Gly Asp Gly Asn Gly Arg Val Gly Phe 
35 40 45 

ggt tac ggt aaa gcg cgt gaa gtt cca gca gcg ate cag aaa gcg atg 192 
Gly Tyr Gly Lys Ala Arg Glu Val Pro Ala Ala He Gin Lys Ala Met 
50 55 60 

gaa aaa gee cgt cgc aat atg att aac gtc gcg ctg aat aac ggc act 240 
Glu Lys Ala Arg Arg Asn Met He Asn Val Ala Leu Asn Asn Gly Thr 
65 70 75 80 
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ctg caa cac cct gtt aaa ggt gtt cac a eg ggt tct cgc gta ttc atg 288 
Leu Gin His Pro Val Lys Gly Val His Thr Gly Ser Arg Val Phe Met 
85 90 95 

cag ccg get tec gaa ggt ace ggt ate ate gee ggt ggt gca atg cgc 336 
Gin Pro Ala Ser Glu Gly Thr Gly lie lie Ala Gly Gly Ala Met Arg 
100 105 110 

gee gtt ctg gaa gtc get ggg gtt cat aac gtt ctg get aaa gec tat 384 
Ala Val Leu Glu Val Ala Gly Val His Asn Val Leu Ala Lys Ala Tyr 
115 120 125 

ggt tec ace aac ccg ate aac gtg gtt cgt gca act att gat ggc ctg 432 
Gly Ser Thr Asn Pro lie Asn Val Val Arg Ala Thr lie Asp Gly Leu 
130 135 140 

gaa aat atg aat tct cca gaa atg gtc get gee aag cgt ggt aaa tec 480 
Glu Asn Met Asn Ser Pro Glu Met Val Ala Ala Lys Arg Gly Lys Ser 
145 150 155 160 

gtt gaa gaa att ctg ggg aaa taa 504 
Val Glu Glu He Leu Gly Lys * 
165 



<210> 99 
<211> 354 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (354) 

<400> 99 

atg gat aag aaa tct get cgt ate cgt cgt gcg ace cgc gca cgc cgc 48 
Met Asp Lys Lys Ser Ala Arg He Arg Arg Ala Thr Arg Ala Arg Arg 
15 10 15 

aag etc cag gag ctg ggc gca act cgc ctg gtg gta cat cgt ace ccg 96 
Lys Leu Gin Glu Leu Gly Ala Thr Arg Leu Val Val His Arg Thr Pro 
20 25 30 

cgt cac att tac gca cag gta att gca ccg aac ggt tct gaa gtt ctg 144 
Arg His He Tyr Ala Gin Val He Ala Pro Asn Gly Ser Glu Val Leu 
35 40 45 

gta get get tct act gta gaa aaa get ate get gaa caa ctg aag tac 192 
Val Ala Ala Ser Thr Val Glu Lys Ala He Ala Glu Gin Leu Lys Tyr 
50 55 60 

ace ggt aac aaa gac gcg get gca get gtg ggt aaa get gtc get gaa 240 
Thr Gly Asn Lys Asp Ala Ala Ala Ala Val Gly Lys Ala Val Ala Glu 
65 70 75 80 

cgc get ctg gaa aaa ggc ate aaa gat gta tec ttt gac cgt tec ggg 288 
Arg Ala Leu Glu Lys Gly He Lys Asp Val Ser Phe Asp Arg Ser Gly 
85 90 95 

ttc caa tat cat ggt cgt gtc cag gca ctg gca gat get gee cgt gaa 336 
Phe Gin Tyr His Gly Arg Val Gin Ala Leu Ala Asp Ala Ala Arg Glu 
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100 



105 



110 



get ggc ctt cag ttc taa 
Ala Gly Leu Gin Phe * 
115 



354 



<210> 100 
<211> 534 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (534) 

<400> 100 

atg tct cgt gtt get aaa gca ccg gtc gtt gtt cct gec ggc gtt gac 48 

Met Ser Arg Val Ala Lys Ala Pro Val Val Val Pro Ala Gly Val Asp 
15 10 15 

gta aaa ate aac ggt cag gtt att acg ate aaa ggt aaa aac ggc gag 96 
Val Lys lie Asn Gly Gin Val lie Thr lie Lys Gly Lys Asn Gly Glu 
20 25 30 

ctg act cgt act etc aac gat get gtt gaa gtt aaa cat gca gat aat 144 
Leu Thr Arg Thr Leu Asn Asp Ala Val Glu Val Lys His Ala Asp Asn 
35 40 45 

ace ctg acc ttc ggt ccg cgt gat ggt tac gca gac ggt tgg gca cag 192 
Thr Leu Thr Phe Gly Pro Arg Asp Gly Tyr Ala Asp Gly Trp Ala Gin 
50 55 60 

get ggt acc gcg cgt gec ctg ctg aac tea atg gtt ate ggt gtt acc 240 
Ala Gly Thr Ala Arg Ala Leu Leu Asn Ser Met Val He Gly Val Thr 
65 70 75 80 

gaa ggc ttc act aag aag ctg cag ctg gtt ggt gta ggt tac cgt gca 288 
Glu Gly Phe Thr Lys Lys Leu Gin Leu Val Gly Val Gly Tyr Arg Ala 
85 90 95 

gcg gtt aaa ggc aat gtg att aac ctg tct ctg ggt ttc tct cat cct 336 
Ala Val Lys Gly Asn Val He Asn Leu Ser Leu Gly Phe Ser His Pro 
100 105 110 

gtt gac cat cag ctg cct gcg ggt ate act get gaa tgt ccg act cag 384 
Val Asp His Gin Leu Pro Ala Gly He Thr Ala Glu Cys Pro Thr Gin 
115 120 125 

act gaa ate gtg ctg aaa ggc get gat aag cag gtg ate ggc cag gtt 432 
Thr Glu He Val Leu Lys Gly Ala Asp Lys Gin Val He Gly Gin Val 
130 135 140 

gca gcg gat ctg cgc gee tac cgt cgt cct gag cct tat aaa ggc aag 480 
Ala Ala Asp Leu Arg Ala Tyr Arg Arg Pro Glu Pro Tyr Lys Gly Lys 
145 150 155 160 

ggt gtt cgt tac gee gac gaa gtc gtg cgt acc aaa gag get aag aag 528 
Gly Val Arg Tyr Ala Asp Glu Val Val Arg Thr Lys Glu Ala Lys Lys 
165 170 175 
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aag taa 534 
Lys * 



<210> 101 
<211> 393 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (393) 

<400> 101 

atg age atg caa gat ccg ate gcg gat atg ctg acc cgt ate cgt aac 48 
Met Ser Met Gin Asp Pro He Ala Asp Met Leu Thr Arg He Arg Asn 
15 10 15 

ggt cag gec gcg aac aaa get gcg gtc acc atg cct tec tec aag ctg 96 
Gly Gin Ala Ala Asn Lys Ala Ala Val Thr Met Pro Ser Ser Lys Leu 
20 25 30 

aaa gtg gca ate gee aac gtg ctg aag gaa gaa ggt ttt att gaa gat 144 
Lys Val Ala He Ala Asn Val Leu Lys Glu Glu Gly Phe He Glu Asp 
35 40 45 

ttt aaa gtt gaa ggc gac acc aag cct gaa ctg gaa ctt act ctg aag 192 
Phe Lys Val Glu Gly Asp Thr Lys Pro Glu Leu Glu Leu Thr Leu Lys 
50 55 60 

tat ttc cag ggc aaa get gtt gta gaa age att cag cgt gtc age cgc 240 
Tyr Phe Gin Gly Lys Ala Val Val Glu Ser He Gin Arg Val Ser Arg 
65 70 75 80 

cca ggt ctg cgc ate tat aaa cgt aaa gat gag ctg ccg aaa gtt atg 288 
Pro Gly Leu Arg He Tyr Lys Arg Lys Asp Glu Leu Pro Lys Val Met 
85 90 95 

gcg ggt ctg ggt ate gca gtt gtt tct acc tct aaa ggt gtt atg act 336 
Ala Gly Leu Gly He Ala Val Val Ser Thr Ser Lys Gly Val Met Thr 
100 105 110 

gat cgt gca gcg cgc cag get ggt ctt ggt ggc gaa att ate tgc tac 384 
Asp Arg Ala Ala Arg Gin Ala Gly Leu Gly Gly Glu He He Cys Tyr 
115 120 125 

gta gec taa 393 
Val Ala * 
130 



<210> 102 
<211> 306 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(306) 
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<400> 102 

atg get aag caa tea atg aaa gca cgc gaa gta aaa cgc gta get tta 48 
Met Ala Lys Gin Ser Met Lys Ala Arg Glu Val Lys Arg Val Ala Leu 
15 10 15 

get gat aaa tac ttc gcg aaa cgc get gaa ctg aaa gcg ate ate tct 96 
Ala Asp Lys Tyr Phe Ala Lys Arg Ala Glu Leu Lys Ala lie lie Ser 
20 25 30 

gat gtg aac get tec gac gaa gat cgt tgg aac get gtt etc aag ctg 144 
Asp Val Asn Ala Ser Asp Glu Asp Arg Trp Asn Ala Val Leu Lys Leu 
35 40 45 

cag act ctg ccg cgt gat tec age ccg tct cgt cag cgt aac cgc tgc 192 
Gin Thr Leu Pro Arg Asp Ser Ser Pro Ser Arg Gin Arg Asn Arg Cys 
50 55 60 

cgt caa aca ggt cgt ccg cat ggt ttc ctg egg aag ttc ggg ttg age 240 
Arg Gin Thr Gly Arg Pro His Gly Phe Leu Arg Lys Phe Gly Leu Ser 
65 70 75 80 

cgt att aag gtc cgt gaa gee get atg cgc ggt gaa ate ccg ggt ctg 288 
Arg lie Lys Val Arg Glu Ala Ala Met Arg Gly Glu lie Pro Gly Leu 
85 90 95 

aaa aag get age tgg taa 306 
Lys Lys Ala Ser Trp * 
100 



<210> 103 
<211> 540 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(540) 

<400> 103 

atg gcg aaa ctg cat gat tac tac aaa gac gaa gta gtt aaa aaa etc 48 
Met Ala Lys Leu His Asp Tyr Tyr Lys Asp Glu Val Val Lys Lys Leu 
15 10 15 

atg act gag ttt aac tac aat' tct gtc atg caa gtc cct egg gtc gag 96 
Met Thr Glu Phe Asn Tyr Asn Ser Val Met Gin Val Pro Arg Val Glu 
20 25 30 

aag ate ace ctg aac atg ggt gtt ggt gaa gcg ate get gac aaa aaa 144 
Lys lie Thr Leu Asn Met Gly Val Gly Glu Ala lie Ala Asp Lys Lys 
35 40 45 

ctg ctg gat aac gca gca gca gac ctg gca gca ate tec ggt caa aaa 192 
Leu Leu Asp Asn Ala Ala Ala Asp Leu Ala Ala lie Ser Gly Gin Lys 
50 55 60 

ccg ctg ate acc aaa gca cgc aaa tct gtt gca ggc ttc aaa ate cgt 240 
Pro Leu He Thr Lys Ala Arg Lys Ser Val Ala Gly Phe Lys He Arg 
65 70 75 80 

cag ggc tat ccg ate ggc tgt aaa gta act ctg cgt ggc gaa cgc atg 288 
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Gin Gly Tyr Pro lie Gly Cys Lys Val Thr Leu Arg Gly Glu Arg Met 
85 90 95 

tgg 3*3 ttc ttfc 3*3 ctg atc ac t att get gta cct cgt ate cgt 336 

Trp Glu Phe Phe Glu Arg Leu lie Thr lie Ala Val Pro Arg lie Arg 
100 105 110 

gac ttc cgt ggc ctg tec get aag tct ttc gac ggt cgt ggt aac tac 384 
Asp Phe Arg Gly Leu Ser Ala Lys Ser Phe Asp Gly Arg Gly Asn Tyr 
115 120 125 

age atg ggt gtc cgt gag cag atc atc ttc cca gaa atc gac tac gat 432 
Ser Met Gly Val Arg Glu Gin lie lie Phe Pro Glu lie Asp Tyr Asp 
130 135 140 

aaa gtc gac cgc gtt cgt ggt ttg gat att acc att ace act act gcg 480 
Lys Val Asp Arg Val Arg Gly Leu Asp lie Thr lie Thr Thr Thr Ala 
145 150 155 160 

aaa tct gac gaa gaa ggc cgc get ctg ctg get gee ttt gac ttc ccg 528 
Lys Ser Asp Glu Glu Gly Arg Ala Leu Leu Ala Ala Phe Asp Phe Pro 
165 170 " 175 

ttc cgc aag taa 540 
Phe Arg Lys * 



<210> 104 
<211> 315 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (315) 

<400> 104 

atg gca gcg aaa atc cgt cgt gat gac gaa gtt atc gtg tta acc ggt 48 
Met Ala Ala Lys lie Arg Arg Asp Asp Glu Val lie Val Leu Thr Gly 
1 5 ~ 10 15 

aaa gat aaa ggt aaa cgc ggt aaa gtt aag aat gtc ctg tct tec ggc 96 
Lys Asp Lys Gly Lys Arg Gly Lys Val Lys Asn Val Leu Ser Ser Gly 
20 25 30 

aag gtc att gtt gaa ggt atc aac ctg gtt aag aaa cat cag aag ccg 144 
Lys Val lie Val Glu Gly lie Asn Leu Val Lys Lys His Gin Lys Pro 
35 40 45 

gtt ccg gee ctg aac caa ccg ggt ggc atc gtt gaa aaa gaa gec get 192 
Val Pro Ala Leu Asn Gin Pro Gly Gly lie Val Glu Lys Glu Ala Ala 
50 55 60 

att cag gtt tec aac gta gca atc ttc aat gcg gca acc ggc aag get 240 
He Gin Val Ser Asn Val Ala He Phe Asn Ala Ala Thr Gly Lys Ala 
65 70 75 80 

gac cgt gta ggc ttt aga ttc gaa gac ggt aaa aaa gtc cgt ttc ttc 288 
Asp Arg Val Gly Phe Arg Phe Glu Asp Gly Lys Lys Val Arg Phe Phe 
85 90 95 
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aag tct aac age gaa act ate aag taa 315 
Lys Ser Asn Ser Glu Thr He Lys * 
10O 



<210> 105 
<211> 372 
<212> DKA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (372) 

<400> 105 

atg ate caa gaa cag act atg ctg aac gtc gec gac aac tec ggt gca 48 
Met He Gin Glu Gin Thr Met Leu Asn Val Ala Asp Asn Ser Gly Ala 
15 10 15 

cgt cgc gta atg tgt ate aag gtt ctg ggt ggc teg cac cgt cgc tac 96 
Arg Arg Val Met Cys He Lys Val Leu Gly Gly Ser His Arg Arg Tyr 
20 25 30 

gca ggc gta ggc gac ate ate aag ate ace ate aaa gaa gca att ccg 144 
Ala Gly Val Gly Asp He He Lys He Thr He Lys Glu Ala He Pro 
35 40 45 

cgt ggt aag gtc aaa aaa ggt gat gtg ctg aag gcg gta gtg gtg cgc 192 
Arg Gly Lys Val Lys Lys Gly Asp Val Leu Lys Ala Val Val Val Arg 
50 55 60 

ace aag aag ggt gtt cgt cgc ccg gac ggt tct gtc att cgc ttc gat 240 
Thr Lys Lys Gly Val Arg Arg Pro Asp Gly Ser Val He Arg Phe Asp 
65 70 75 80 

ggt aat get tgt gtt ctt ctg aac aac aac age gag cag cct ate ggt 288 
Gly Asn Ala Cys Val Leu Leu Asn Asn Asn Ser Glu Gin Pro He Gly 
85 90 95 

acg cgt att ttt ggg ccg gta act cgt gag ctt cgt agt gag aag ttc 336 
Thr Arg He Phe Gly Pro Val Thr Arg Glu Leu Arg Ser Glu Lys Phe 
100 105 110 

atg aaa att ate tct ctg gca cca gaa gta etc taa 372 
Met Lys He He Ser Leu Ala Pro Glu Val Leu * 
115 120 



<210> 106 
<211> 1458 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1458) 

<400> 106 

gtg gga att tat ttt act aat tea gat gat caa att tac ttt aaa agg 48 
Met Gly He Tyr Phe Thr Asn Ser Asp Asp Gin He Tyr Phe Lys Arg 
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15 10 15 

agt gaa ggg atg tct gat ata aac cat gca ggt tct gac ctt ata ttt 96 
Ser Glu Gly Met Ser Asp lie Asn His Ala Gly Ser Asp Leu He Phe 
20 25 30 

gaa ctg gag gat cgc cct ccc ttt cat cag get etc gtt ggt gec att 144 
Glu Leu Glu Asp Arg Pro Pro Phe His Gin Ala Leu Val Gly Ala He 
35 40 45 

acc cat ctg ttg gca att ttc gtt ccg atg gta acc ccc gcg tta ate 192 
Thr His Leu Leu Ala Xle Phe Val Pro Met Val Thr Pro Ala Leu He 
50 55 60 

gtg ggt gcg gec tta cag ctt tec get gaa aca act gee tat ctt gtt 240 
Val Gly Ala Ala Leu Gin Leu Ser Ala Glu Thr Thr Ala Tyr Leu Val 
65 70 75 80 

tct atg gcg atg ate gec tct ggt att ggt acc tgg tta caa gta aac 288 
Ser Met Ala Met He Ala Ser Gly He Gly Thr Trp Leu Gin Val Asn 
85 90 95 

cgc tac ggc ate gtc ggt tct ggc eta etc tea att cag tea gtc aat 336 
Arg Tyr Gly He Val Gly Ser Gly Leu Leu Ser He Gin Ser Val Asn 
100 105 110 

ttt tea ttt gtt acg gtc atg att gcg ctg ggc age age atg aaa age 384 
Phe Ser Phe Val Thr Val Met He Ala Leu Gly Ser Ser Met Lys Ser 
115 120 125 

gac ggt ttt cac gaa gag tta ate atg teg teg ctt etc ggc gtc tec 432 
Asp Gly Phe His Glu Glu Leu He Met Ser Ser Leu Leu Gly Val Ser 
130 135 140 

ttc gtt ggc gca ttt ctg gtt gtc gga tct tea ttt ate ttg ccc tat 480 
Phe Val Gly Ala Phe Leu Val Val Gly Ser Ser Phe He Leu Pro Tyr 
145 150 155 160 

tta cgt egg gtt att acg cct acc gtc age ggt att gtg gta ctg atg 52 8 
Leu Arg Arg Val He Thr Pro Thr Val Ser Gly He Val Val Leu Met 
165 170 175 

ate ggc tta age ctg att aaa gtc ggc att ate gat ttt ggt gga gga 576 
He Gly Leu Ser lieu He Lys Val Gly He He Asp Phe Gly Gly Gly 
180 185 190 

ttt gca gee aaa age age ggt acg ttc ggc aat tac gaa cat etc ggc 624 
Phe Ala Ala Lys Ser Ser Gly Thr Phe Gly Asn Tyr Glu His Leu Gly 
195 200 205 

gtt ggt tta ttg gtt tta att gtg gtg ate ggc ttt aac tgc tgt cgc 672 
Val Gly Leu Leu Val Leu He Val Val He Gly Phe Asn Cys Cys Arg 
210 215 220 

agt ccg ttg eta cgc atg gga ggg ate gee att ggg eta tgt gtc ggc 720 
Ser Pro Leu Leu Arg Met Gly Gly He Ala He Gly Leu Cys Val Gly 
225 230 235 240 

tat ate gca teg tta tgc ctg ggc atg gtg gat ttc age agt atg cgc 768 
Tyr He Ala Ser Leu Cys Leu Gly Met Val Asp Phe Ser Ser Met Arg 
245 250 255 
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aat ttg ccg tta ate acc ate ccg cat ccg ttc aaa tac ggc ttt agt 816 

Asn Leu Pro Leu lie Thr lie Pro His Pro Phe Lys Tyr Gly Phe Ser 
260 265 270 

* 

ttt age ttc cat cag ttc ctg gtg gtt ggc acg att tat ctg ctt age 864 

Phe Ser Phe Hie Gin Phe Leu Val Val Gly Thr lie Tyr Leu Leu Ser 

275 280 285 

gtg ctg gaa gca gtc ggc gat ate acc gee acg gca atg gtt tec cgc 912 
Val Leu Glu Ala Val Gly Asp lie Thr Ala Thr Ala Met Val Ser Arg 
290 295 300 

cgc ccc att cag ggg gaa gag tat cag tec egg ctg aaa ggc ggc gtg 960 
Arg Pro lie Gin Gly Glu Glu Tyr Gin Ser Arg Leu Lys Gly Gly Val 
305 310 315 320 

ctg gca gat ggt ctg gtt tct gtt ate gee tec get gtc ggt tea tta 1008 
Leu Ala Asp Gly Leu Val Ser Val lie Ala Ser Ala Val Gly Ser Leu 
325 330 335 

cca tta acc acg ttt gcg caa aat aat ggg gtt att cag atg act ggc 1056 
Pro Leu Thr Thr Phe Ala Gin Asn Asn Gly Val lie Gin Met Thr Gly 
340 345 350 

gtc get tea cgt tat gtc ggg cga acc ate gcg gta atg ctg gtt ate 1104 
Val Ala Ser Arg Tyr Val Gly Arg Thr lie Ala Val Met Leu Val lie 
355 360 365 

etc ggc tta ttt ccg atg att ggc ggc ttc ttc acg acc att ccc teg 1152 
Leu Gly Leu Phe Pro Met lie Gly Gly Phe Phe Thr Thr lie Pro Ser 
370 375 380 

gca gtt ctg gga ggc gca atg acg ttg atg ttt tec atg att gee ate 1200 
Ala Val Leu Gly Gly Ala Met Thr Leu Met Phe Ser Met lie Ala lie 
385 390 395 400 

gca ggg att cgc ate ate ate acc aac ggt tta aag cgc cgt gaa aca 1248 
Ala Gly He Arg He He He Thr Asn Gly Leu Lys Arg Arg Glu Thr 
405 410 415 

ctt att gtc gee act tct tta ggt tta ggg ctt ggc gtc tec tac gat 1296 
Leu He Val Ala Thr Ser Leu Gly Leu Gly Leu Gly Val Ser Tyr Asp 
420 425 430 

ccc gaa att ttt aaa ata ttg cca gee tct att tat gta tta gtt gaa 1344 
Pro Glu He Phe Lys He Leu Pro Ala Ser He Tyr Val Leu Val Glu 
435 440 445 

aac cct att tgt get ggc ggg tta act gcg att tta tta aat att ate 1392 
Asn Pro He Cys Ala Gly Gly Leu Thr Ala He Leu Leu Asn He He 
450 455 460 

etc cct ggt ggc tac cga cag gaa aac gtt ctg cct ggt att acc tea 1440 
Leu Pro Gly Gly Tyr Arg Gin Glu Asn Val Leu Pro Gly He Thr Ser 
465 470 475 480 

gcg gaa gag atg gat taa 1458 
Ala Glu Glu Met Asp * 
485 
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<210> 107 
<211> 1320 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1320) 

<400> 107 

atg atg tea gga gaa cac acg tta aaa gcg gta cga ggc agt ttt att 48 
Met Met Ser Gly Glu His Thr Leu Lys Ala Val Arg Gly Ser Phe lie 
15 10 15 

gat gtc acc cgt acg ate gat aac ccg gaa gag att gec tct gcg ctg 96 
Asp .Val Thr Arg Thr lie Asp Asn Pro Glu Glu lie Ala Ser Ala Leu 
20 -25 30 

egg ttt att gag gat ggt tta tta etc att aaa cag gga aaa gtg gaa 144 
Arg Phe lie Glu Asp Gly Leu Leu Leu He Lys Gin Gly Lys Val Glu 
35 40 45 

tgg ttt ggc gaa tgg gaa aac gga aag cat caa att cct gac acc att 192 
Trp Phe Gly Glu Trp Glu Asn Gly Lys His Gin He Pro Asp Thr He 
50 55 60 

cgc gtg cgc gac tat cgc ggc aaa ctg ata gta ccg ggc ttt gtc gat 240 
Arg Val Arg Asp Tyr Arg Gly Lys Leu He Val Pro Gly Phe Val Asp 
65 70 75 80 

aca cat ate cat tat ccg caa agt gaa atg gtg ggg gee tat ggt gag 288 
Thr His He His Tyr Pro Gin Ser Glu Met Val Gly Ala Tyr Gly Glu 
85 90 95 

caa ttg ctg gag tgg ttg aat aaa cac acc ttc cct act gaa cgt cgt 336 
Gin Leu Leu Glu Trp Leu Asn Lys His Thr Phe Pro Thr Glu Arg Arg 
100 105 110 

tat gag gat tta gag tac gec cgc gaa atg teg gcg ttc ttc ate aag 384 
Tyr Glu Asp Leu Glu Tyr Ala Arg Glu Met Ser Ala Phe Phe He Lys 
115 120 125 

cag ctt tta cgt aac gga acc acc acg gcg ctg gtg ttt ggc act gtt 432 
Gin Leu Leu Arg Asn Gly. Thr Thr Thr Ala Lieu Val Phe Gly Thr Val 
130 135 140 

cat ccg caa tct gtt gat gcg ctg ttt gaa gee gee agt cat ate aat 480 
His Pro Gin Ser Val Asp Ala Leu Phe Glu Ala Ala Ser His lie Asn 
145 150 155 160 

atg cgt atg att gec ggt aag gtg atg atg gac cgc aac gca ccg gat 528 
Met Arg Met He Ala Gly Lys Val Met Met Asp Arg Asn Ala Pro Asp 
165 170 175 

tat ctg etc gac act gee gaa age age tat cac caa age aaa gaa ctg 576 
Tyr Leu Leu Asp Thr Ala Glu Ser Ser Tyr His Gin Ser Lys Glu Leu 
180 185 190 

ate gaa cgc tgg cac aaa aat ggt cgt ctg eta tat gcg att acg cca 624 
He Glu Arg Trp His Lys Asn Gly Arg Leu Leu Tyr Ala He Thr Pro 
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195 200 205 

cgc ttc gcc ccg acc tea tct cct gaa cag atg gcg atg gcg caa cgc 672 
Arg Phe Ala Pro Thr Ser Ser Pro Glu Gin Met Ala Met Ala Gin Arg 
210 215 220 

ctg aaa gaa gaa tat ccg gat acg tgg gta cat acc cat etc tgt gaa 720 
Leu Lys Glu Glu Tyr Pro Asp Thr Trp Val His Thr His Leu Cys Glu 
225 230 235 240 

aac aaa gat gaa att gcc tgg gtg aaa teg ctt tat cct gac cat gat 768 
Asn Lys Asp Glu He Ala Trp Val Lys Ser Leu Tyr Pro Asp His Asp 
245 250 255 

ggt tat ctg gat gtt tac cat cag tac ggc ctg acc ggt aaa aac tgt 816 
Gly Tyr Leu Asp Val Tyr His Gin Tyr Gly Leu Thr Gly Lys Asn Cys 
260 265 270 

gtc ttt get cac tgc gtc cat etc gaa gaa aaa gag tgg gat cgt etc 864 
Val Phe Ala His Cys Val His Leu Glu Glu Lys Glu Trp Asp Arg Leu 
275 280 285 

age gaa acc aaa tec age att get ttc tgt ccg acc tec aac ctt tac 912 
Ser Glu Thr Lys Ser Ser He Ala Phe Cys Pro Thr Ser Asn Leu Tyr 
290 295 300 

etc ggc age ggc tta ttc aac ttg aaa aaa gca tgg cag aag aaa gtt 960 
Leu Gly Ser Gly Leu Phe Asn Leu Lys Lys Ala Trp Gin Lys Lys Val 
305 310 315 320 

aaa gtg ggc atg gga acg gat ate ggt gcc gga acc act ttc aac atg 1008 
Lys Val Gly Met Gly Thr Asp He Gly Ala Gly Thr Thr Phe Asn Met 
325 330 335 

ctg caa acg ctg aac gaa gcc tac aaa gta ttg caa tta caa ggc tat 1056 
Leu Gin Thr Leu Asn Glu Ala Tyr Lys Val Leu Gin Leu Gin Gly Tyr 
340 345 350 

cgc etc teg gca tat gaa gcg ttt tac ctg gcc acg etc ggc gga gcg 1104 
Arg Leu Ser Ala Tyr Glu Ala Phe Tyr Leu Ala Thr Leu Gly Gly Ala 
355 360 365 

aaa tct ctg ggc ctt gac gat ttg att ggc aac ttt tta cct ggc aaa 1152 
Lys Ser Leu Gly Leu Asp Asp Leu lie Gly Asn Phe Leu Pro Gly Lys 
370 375 380 

gag get gat ttc gtg gtg atg gaa ccc acc gcc act ccg eta cag cag 1200 
Glu Ala Asp Phe Val Val Met Glu Pro Thr Ala Thr Pro Leu Gin Gin 
385 390 395 400 

ctg cgc tat gac aac tct gtt tct tta gtc gac aaa ttg ttc gtg atg 1248 
Leu Arg Tyr Asp Asn Ser Val Ser Leu Val Asp Lys Leu Phe Val Met 
405 410 415 

atg acg ttg ggc gat gac cgt teg ate tac cgc acc tac gtt gat ggt 1296 
Met Thr Leu Gly Asp Asp Arg Ser He Tyr Arg Thr Tyr Val Asp Gly 
420 425 430 

cgt ctg gtg tac gaa cgc aac taa 1320 
Arg Leu Val Tyr Glu Arg Asn * 
435 
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<210> 108 
<211> 570 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (570) 

<400> 108 

atg tct gga gac ate eta caa aca ccg gac gca cca aag cca cag ggc 48 
Met Ser Gly Asp lie Leu Gin Thr Pro Asp Ala Pro Lys Pro Gin Gly 
15 10 15 

gcg ctg gat aat tat ttt aaa att acc get cgt ggc agt acc gtt cgt 96 
Ala Leu Asp Asn Tyr Phe Lys lie Thr Ala Arg Gly Ser Thr Val Arg 
20 25 30 

cag gaa gta ctg get ggc tta acg acc ttt ctg gee atg gtt tat tec 144 
Gin Glu Val Leu Ala Gly Leu Thr Thr Phe Leu Ala Met ValTyr Ser 
35 40 45 

gtt ate gtc gtt ccg gga atg ctg ggc aaa gca ggt ttt cct ccc gca 192 
Val He Val Val Pro Gly Met Leu Gly Lys Ala Gly Phe Pro Pro Ala 
50 55 60 

get gtg ttt gtt gee acc tgt ctg gtc gcg ggc ttc ggc teg ttg ctg 240 
Ala Val Phe Val Ala Thr Cys Leu Val Ala Gly Phe Gly Ser Leu Leu 
65 70 75 80 

at ST 9^ a t fca tgg Sr ct aat ttg cca atg gcg att ggt tgc gcg att tec 288 
Met Gly Leu Trp Ala Asn Leu Pro Met Ala He Gly Cys Ala He Ser 
85 90 95 

ttg acg gcg ttt acc gca ttc agt ctg gta etc ggg caa caa att age 336 
Leu Thr Ala Phe Thr Ala Phe Ser Leu Val Leu Gly Gin Gin He Ser 
100 105 110 

gtt cct gtc gca ctg ggc gcg gta ttt ctg atg ggc gtc ate ttc acc 384 
Val Pro Val Ala Leu Gly Ala Val Phe Leu Met Gly Val He Phe Thr 
115 120 125 

gec att tec gta acc ggt gtg cgt acc tgg ate tta cgt aat ttg ccg 432 
Ala He Ser Val Thr Gly Val Arg Thr Trp He Leu Arg Asn Leu Pro 
130 135 140 

atg ggt ate get cac ggt aca ggt ate ggt ate ggg ctg ttt ctg ctg 480 
Met Gly He Ala His Gly Thr Gly He Gly He Gly Leu Phe Leu Leu 
145 150 155 160 

ctg att get get aac ggt gtg ggt atg gtt ate aaa aac ccg att gaa 528 
Leu He Ala Ala Asn Gly Val Gly Met Val He Lys Asn Pro He Glu 
165 170 175 

ggc ttg cag tgg cgc teg gtg cgt tta cct cct tec egg tga 570 
Gly Leu Gin Trp Arg Ser Val Arg Leu Pro Pro Ser Arg * 
180 185 
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<210> 109 
<211> 831 
<212> DNA 

<213> Escherichia coli 

4 

<220> 

<221> CDS 

<222> (1) . . . (831) 

<400> 109 

gtg gcg etc ggt gcg ttt acc tec ttc ccg gtg atg atg age ttg ctg 48 
Met Ala Leu Gly Ala Phe Thr Ser Phe Pro Val Met Met Ser Leu Leu 
1 5 10 15 

ggg ctg gcg gtc ate ttc ggc ctg gag aag tgt cgc gta ccc ggc ggg 96 
Gly Leu Ala Val lie Phe Gly Leu Glu Lys Cys Arg Val Pro Gly Gly 
20 25 30 

ate ttg ttg gtg att att gca att teg ate ate ggc tta ate ttt gac 144 
lie Leu Leu Val lie lie Ala lie Ser lie lie Gly Leu lie Phe Asp 
35 40 45 

cca gcg gtg aaa tac cac ggt ctg gtg gcg atg cca age ctg act ggc 192 
Pro Ala Val Lys Tyr His Gly Leu Val Ala Met Pro Ser Leu Thr Gly 
50 55 60 

gaa gat ggt aag tct ctg att ttc age etc gat att atg ggt gca etc 240 
Glu Asp Gly Lys Ser Leu lie Phe Ser Leu Asp lie Met Gly Ala Leu 
65 70 75 80 

cag cca act gta ctt ccg agt gta ctg gca ttg gtg atg acc gca gtg 288 
Gin Pro Thr Val Leu Pro Ser Val Leu Ala Leu Val Met Thr Ala Val 
85 90 95 

ttc gac get act ggc acc ate cgt gee gtc gee ggt cag gcg aat ttg 336 
Phe Asp Ala Thr Gly Thr lie Arg Ala Val Ala Gly Gin Ala Asn Leu 
100 105 110 

ttg gat aaa gac aac cag ate ate aac ggc ggc aaa gee ctg acc agt 384 
Leu Asp Lys Asp Asn Gin He He Asn Gly Gly Lys Ala Leu Thr Ser 
115 120 125 

gac tea gta agt tea ata ttc tec ggc ctg gtg ggc gca gcg ccc gca 432 
Asp Ser Val Ser Ser He Phe Ser Gly Leu Val Gly Ala Ala Pro Ala 
130 135 140 

gcg gtt tat ate gaa tea gcg gca gga acc gee gee ggg ggt aaa aca 480 
Ala Val Tyr He Glu Ser Ala Ala Gly Thr Ala Ala Gly Gly Lys Thr 
145 150 155 160 

ggt tta acc gca acc gta gtg ggg gcg tta ttc ctg tta att ctg ttt 528 
Gly Leu Thr Ala Thr Val Val Gly Ala Leu Phe Leu Leu He Leu Phe 
165 170 175 

tta tea ccg ctg tea ttt ttg ate cct ggt tac gee act gca ccc get 576 
Leu Ser Pro Leu Ser Phe Leu He Pro Gly Tyr Ala Thr Ala Pro Ala 
180 185 190 

ctg atg tac gta ggt ttg ctg atg tta agt aac gtc teg aag ctg gat 624 
Leu Met Tyr Val Gly Leu Leu Met Leu Ser Asn Val Ser Lys Leu Asp 
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X95 200 205 

ttc aat gat ttt att gac get atg get ggc ctg gtg tgt gee gtg ttc 672 
Phe Asn Asp Phe lie Asp Ala Met Ala Gly Leu Val Cys Ala Val Phe 
210 215 220 

ate gtt ctg act tgt aat ate gtt ace ggt att atg ctg ggc ttt gtg 720 
He Val Leu Thr Cys Asn lie Val Thr Gly He Met Leu Gly Phe Val 
225 230 235 240 

aca ctg gtc gta ggc cgc gtc ttt gca cgc gaa tgg caa aag ctg aat 768 
Thr Leu Val Val Gly Arg Val Phe Ala Arg Glu Trp Gin Lys Leu Asn 
245 250 255 

att ggt acg gtg ate att act gee gca ctg gtc gca ttt tac gcg ggt 816 
He Gly Thr Val He He Thr Ala Ala Leu Val Ala Phe Tyr Ala Gly 
260 265 270 

ggt tgg gca ate taa 831 
Gly Trp Ala He * 
275 



<210> 110 
<211> 1401 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . , . (1401) 

<400> 110 

atg aat age gaa ggg ggg aaa ccg ggg aat gta ctg ace gtt aac ggc 48 
Met Asn Ser Glu Gly Gly Lys Pro Gly Asn Val Leu Thr Val Asn Gly 
1 5 10 15 

aac tat ace gga aac aat ggc ctg atg acg ttc aac gcg acg ctg ggc 96 
Asn Tyr Thr Gly Asn Asn Gly Leu Met Thr Phe Asn Ala Thr Leu Gly 
20 25 30 

ggc gat aat teg ccc acc gat aag atg aac gtg aaa ggc gat ace caa 144 
Gly Asp Asn Ser Pro Thr Asp Lys Met Asn Val Lys Gly Asp Thr Gin 
35 40 45 

ggg aac act cgc gtt egg gtt gat aac att ggc ggc gtc ggt gca caa 192 
Gly Asn Thr Arg Val Arg Val Asp Asn He Gly Gly Val Gly Ala Gin 
50 55 60 

acg gtc aac ggt att gaa etc att gag gtt ggc ggt aat tct gca ggt 240 
Thr Val Asn Gly He Glu Leu He Glu Val Gly Gly Asn Ser Ala Gly 
65 70 75 80 

aac ttc gcg ctg acc acc gga act gtc gaa get ggg get tac gtc tac 288 
Asn Phe Ala Leu Thr Thr Gly Thr Val Glu Ala Gly Ala Tyr Val Tyr 
85 90 95 

acg ctg get aaa ggg aag ggg aat gac gag aaa aac tgg tat ctg acc 336 
Thr Leu Ala Lys Gly Lys Gly Asn Asp Glu Lys Asn Trp Tyr lieu Thr 
100 105 110 
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agt aaa tgg gac ggc gta acg cca gcg gat aca ccc gat ccc ate aat 384 
Ser Lys Trp Asp Gly Val Thr Pro Ala Asp Thr Pro Asp Pro lie Asn 
115 120 125 

aat ccc cct gtt gtg gat ccg gaa ggc cca tea gtt tat cgc ccg gag 432 
Asn Pro Pro Val Val Asp Pro Glu Gly Pro Ser Val Tyr Arg Pro Glu 
130 135 140 

gec gga age tat ate age aac att gee gca gee aac teg ctg ttt age 480 
Ala Gly Ser Tyr lie Ser Asn lie Ala Ala Ala Asn Ser Leu Phe Ser 
145 150 155 160 

cat cgt tta cac gac cgt ctg ggt gag ccg cag tat aca gat tea ctg 528 
His Arg Leu His Asp Arg Leu Gly Glu Pro Gin Tyr Thr Asp Ser Leu 
165 ~ 170 ~ 175 

cat tct cag ggg teg gca age agt atg tgg atg cgt cat gtc gga ggg 576 
His Ser Gin Gly Ser Ala Ser Ser Met Trp Met Arg His Val Gly Gly 
180 185 190 

cac gaa cgt tea agg gee ggt gac ggt cag eta aat act cag get aac 624 
His Glu Arg Ser Arg Ala Gly Asp Gly Gin Leu Asn Thr Gin Ala Asn 
195 ^ 200 * 205 

cgc tat gta ttg cag eta ggc ggc gat ttg gcg cag tgg agt age aac 672 
Arg Tyr Val Leu Gin Leu Gly Gly Asp Leu Ala Gin Trp Ser Ser Asn 
210 215 220 

i 

gcg cag gat cgc tgg cat ctt ggc gtg atg gca ggc tac gee aat cag 720 
Ala Gin Asp Arg Trp His Leu Gly Val Met Ala Gly Tyr Ala Asn Gin 
225 230 235 240 

cac agt aat act cag agt aat' cgt gtg ggt tat aaa teg gat ggg cgc 768 
His Ser Asn Thr Gin Ser Asn Arg Val Gly Tyr Lys Ser Asp Gly Arg 
245 250 255 

ate age ggt tac age get ggg ctg tac gcg ace tgg tat cag aac gat 816 
lie Ser Gly Tyr Ser Ala Gly Leu Tyr Ala Thr Trp Tyr Gin Asn Asp 
260 265 270 

gcg aat aag acc ggc get tat gtt gac age tgg gcg ctg tat aac tgg 864 
Ala Asn Lys Thr Gly Ala Tyr Val Asp Ser Trp Ala Leu Tyr Asn Trp 
275 280 285 

ttt gat aac age gtc agt tec gat aac cgt tct get gac gac tat gat 912 
Phe Asp Asn Ser Val Ser Ser Asp Asn Arg Ser Ala Asp Asp Tyr Asp 
290 295 300 

tct cgc ggt gtg acg gec tct gtt gag ggt ggg tat acc ttt gaa gcg 960 
Ser Arg Gly Val Thr Ala Ser Val Glu Gly Gly Tyr Thr Phe Glu Ala 
305 310 315 320 

gga aca ttt age ggc age gaa ggg acg ctg aat acc tgg tac gtc cag 1008 
Gly Thr Phe Ser Gly Ser Glu Gly Thr Leu Asn Thr Trp Tyr Val Gin 
325 330 335 

cca cag gcg caa ate acc tgg atg ggt gtg aaa gat tec gac cat acc 1056 
Pro Gin Ala Gin lie Thr Trp Met Gly Val Lys Asp Ser Asp His Thr 
340 345 350 

egg aaa gac gga acg cgc att gaa acg gaa ggc gac gga aat gtg caa 1104 
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Arg Lys Asp Gly Thr Arg He Glu Thr Glu Gly Asp Gly Asn Val Gin 
355 360 365 

acg cga ctt ggg gtg aaa acc tac ctg aac age cat cac cag cgt gac 1152 
Thr Arg Leu Gly Val Lys Thr Tyr Leu Asn Ser His His Gin Arg Asp 
370 375 380 

gat ggt aaa cag cgt gag ttc cag cct tac att gaa gcg aac tgg ate 1200 
Asp Gly Lys Gin Arg Glu Phe Gin Pro Tyr He Glu Ala Asn Trp He 
385 390 395 400 

aac aat age aaa gtc tac gec gtg aag atg aat ggt caa acc gta ggc 1248 
Asn Asn Ser Lys Val Tyr Ala Val Lys Met Asn Gly Gin Thr Val Gly 
405 410 415 

cgt gaa ggt gcg cgt aat etc ggt gaa gta cgt acc ggg gtt gag gcg 1296 
Arg Glu Gly Ala Arg Asn Leu Gly Glu Val Arg Thr Gly Val Glu Ala 
420 425 430 

aaa gta aat aac aac ctt age ctg tgg ggg aat gtc ggt gtg caa eta 1344 
Lys Val Asn Asn Asn Leu Ser Leu Trp Gly Asn Val Gly Val Gin Leu 
435 440 445 

ggt gat aaa ggc tat age gat act cag ggc atg ctg gga gtg aaa tat 1392 
Gly Asp Lys Gly Tyr Ser Asp Thr Gin Gly Met Leu Gly Val Lys Tyr 
450 ~ 455 460 » 

age tgg taa 1401 

Ser Trp * 

465 



<210> 111 
<211> 3978 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (3978) 

<400> 111 

atg aat aga ate tat cgc gtg ata tgg aat tgc act eta cag gta ttt 48 
Met Asn Arg He Tyr Arg Val He Trp Asn Cys Thr Leu Gin Val Phe 
15 10 15 

cag gee tgc teg gaa tta act cgc agg gca ggt aaa aca teg acg gtt 96 
Gin Ala Cys Ser Glu Leu Thr Arg Arg Ala Gly Lys Thr Ser Thr Val 
20 25 30 

aat ttg cgt aaa tec tct gga ctg aca acg aaa ttc agt aga ttg acg 144 
Asn Leu Arg Lys Ser Ser Gly Leu Thr Thr Lys Phe Ser Arg Leu Thr 
35 40 45 

ct SJ ggt gtt ttg ctg gca eta age ggt tea gcg tct ggt gca agt ctg 192 
Leu Gly Val Leu Leu Ala Leu Ser Gly Ser Ala Ser Gly Ala Ser Leu 
50 55 60 

gaa gtt gat aat gat cag att acc aat att gat act gat gtt get tat 240 
Glu Val Asp Asn Asp Gin He Thr Asn He Asp Thr Asp Val Ala Tyr 
65 70 75 80 
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gat gcc tac ctg gtt ggc tgg tat ggc act gga gtg ctt aat att ttg 288 
Asp Ala Tyr Leu Val Gly Trp Tyr Gly Thr Gly Val Leu Asn He Leu 
85 90 95 

get ggc ggt aat gcc tec tta acc act att act acc age gtc att ggc 336 
Ala Gly Gly Asn Ala Ser Leu Thr Thr He Thr Thr Ser Val He Gly 
100 105 110 

get aat gag gac tea gag ggc acc gtt aat gtt ttg ggt ggc acc tgg 384 
Ala Asn Glu Asp Ser Glu Gly Thr Val Asn Val Leu Gly Gly Thr Trp 
115 ~ 120 125 

cga ttg tat gat age gga aat aat gca agg cct tta aat gtg ggt caa 432 
Arg Leu Tyr Asp Ser Gly Asn Asn Ala Arg Pro Leu Asn Val Gly Gin 
130 135 140 

tec gga acg ggg acg ctg aat att aaa cag aag ggt cac gtc gat gga 480 
Ser Gly Thr Gly Thr Leu Asn He Lys Gin Lys Gly His Val Asp Gly 
145 150 155 160 

ggc tat tta aga tta ggt tct teg aca gga ggc gtc ggg acg gtc aat 528 
Gly Tyr Leu Arg Leu Gly Ser Ser Thr Gly Gly Val Gly Thr Val Asn 
165 170 175 

gtt gag gga gag gac tct gtt ctg acg acc gaa tta ttc gaa ata ggt 576 
Val Glu Gly Glu Asp Ser Val Leu Thr Thr Glu Leu Phe Glu He Gly 
180 185 190 

age tat ggc aca ggt tea tta aat att acg gat aag ggt tac gtc acg 624 
Ser Tyr Gly Thr Gly Ser Leu Asn He Thr Asp Lys Gly Tyr Val Thr 
195 200 205 

agt tea ate gtc gcc att tta ggc tat cag gcg ggc agt aat ggc cag 672 
Ser Ser He Val Ala He Leu Gly Tyr Gin Ala Gly Ser Asn Gly Gin 
210 215 220 

gtt gtc gtt gaa aag ggt ggc gag tgg eta ata aaa aat aat gat tec 720 
Val Val Val Glu Lys Gly Gly Glu Trp Leu He Lys Asn Asn Asp Ser 
225 230 A 235 240 

tea att gaa ttt caa att ggt aat caa gga act ggg gag gcg act att 768 
Ser He Glu Phe Gin He Gly Asn Gin Gly Thr Gly Glu Ala Thr He 
245 250 255 

cgc gag ggg ggg ttg gtt acg get gaa aat acg att ate ggt ggc aat 816 
Arg Glu Gly Gly Leu Val Thr Ala Glu Asn Thr He He Gly Gly Asn 
260 265 270 

gcc acc ggt ate gga acc ctg aat gtg cag gat caa gac tct gtc ate 864 
Ala Thr Gly He Gly Thr Leu Asn Val Gin Asp Gin Asp Ser Val He 
275 280 2B5 

acg gta cgc aga etc tat aat gga tat ttc ggt aat ggc aca gtc aat 912 
Thr Val Arg Arg Leu Tyr Asn Gly Tyr Phe Gly Asn Gly Thr Val Asn 
290 295 300 

att tec aat aat gga ctg att aat aac aaa gaa tat tea ttg gtg ggc 960 
He Ser Asn Asn Gly Leu He Asn Asn Lys Glu Tyr Ser Leu Val Gly 
305 310 315 320 
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gtt cag gac ggt tec cac ggt gtc gtc aac gtg acc gat aaa ggg cat 1008 
Val Gin Asp Gly Ser His Gly Val Val Asxi Val Thr Asp Lys Gly His 
325 330 335 

tgg aat ttc etc gga acg ggc gaa get ttc cgc tat ate tat ate ggt 1056 
Trp Asn Phe Leu Gly Thr Gly Glu Ala Phe Arg Tyr He Tyr He Gly 
340 345 350 

gat get ggc gac ggt gaa ctt aat gtc teg agt gaa ggc aaa gta gat 1104 
Asp Ala Gly Asp Gly Glu Leu Asn Val Ser Ser Glu Gly Lys Val Asp 
355 360 365 

teg gga att ate act gcg ggg atg aaa gaa aca ggc aca ggc aac att 1152 
Ser Gly He He Thr Ala Gly Met Lys Glu Thr Gly Thr Gly Asn He 
370 375 380 

act gtt aag gat aag aac tec gtt ate act aat etc gga act aat ctt 1200 
Thr Val Lys Asp Lys Asn Ser Val He Thr Asn Leu Gly Thr Asn Leu 
385 390 395 400 

ggt tat gac ggc cac ggc gaa atg aat ate agt aat cag ggg ctt gtt 1248 
Gly Tyr Asp Gly His Gly Glu Met Asn He Ser Asn Gin Gly Leu Val 
405 410 415 

gtc age aac gga gga agt tea etc ggt tat gga gaa acc ggc gtc ggg 1296 
Val Ser Asn Gly Gly Ser Ser Leu Gly Tyr Gly Glu Thr Gly Val Gly 
420 425 430 

aat gtc age ate acc acg ggg gga atg tgg gag gtc aat aag aat gtc 1344 
Asn Val Ser He Thr Thr Gly Gly Met Trp Glu Val Asn Lys Asn Val 
435 440 445 

tat acc acc att ggt gtt gcg ggc gtc gga aac etc aat att age gat 1392 
Tyr Thr Thr He Gly Val Ala Gly Val Gly Asn Leu Asn He Ser Asp 
450 455 460 

ggc ggt aag ttc gta teg caa aat att act ttt ttg ggc gat aaa gca 1440 
Gly Gly Lys Phe Val Ser Gin Asn He Thr Phe Leu Gly Asp Lys Ala 
465 470 475 480 

age ggt ate ggc aca ctg aac ctg atg gat gcg aca tea teg ttc gat 1488 
Ser Gly He Gly Thr Leu Asn Leu Met Asp Ala Thr Ser Ser Phe Asp 
485 490 495 

act gtg ggt ate aat gtc ggt aat ttt ggt age ggt ate gta aat gtc 1536 
Thr Val Gly He Asn Val Gly Asn Phe Gly Ser Gly He Val Asn Val 
500 505 510 

agt aat ggt gee acc ctt aat tea acg ggc tat gga ttt ate gga gga 1584 
Ser Asn Gly Ala Thr Leu Asn Ser Thr Gly Tyr Gly Phe He Gly Gly 
515 520 525 

aat gec tec ggt aag ggg ata gtt aat att tea acg gac agt etc tgg 1632 
Asn Ala Ser Gly Lys Gly He Val Asn He Ser Thr Asp Ser Leu Trp 
530 535 540 

aat tta aag aca tea tct act aac gca caa ttg eta cag gtc ggt gta 1680 
Asn Leu Lys Thr Ser Ser Thr Asn Ala Gin Leu Leu Gin Val Gly Val 
545 550 555 560 

tta ggc acg ggt gaa ctg aat att acc acc gga ggt ata gtt aaa gcg 1728 
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Leu Gly Thr Gly Glu lieu Asn lie Thr Thr Gly Gly lie Val Lys Ala 
565 570 575 

cgt gat aca cag ata get etc aat gac aaa agt aag ggc gac gtg agg 1776 
Arg Asp Thr Gin He Ala Leu Asn Asp Lys Ser Lys Gly Asp Val Arg 
580 585 590 

gtg gat ggg cag aac tct ctt ctt gaa aca ttc aat atg tac gta ggg 1824 
Val Asp Gly Gin Asn Ser lieu Leu Glu Thr Phe Asn Met Tyr Val Gly 
595 600 605 

aca tct ggt acg ggt acg tta acc ctg acg aat aac ggt acg ctg aat 1872 
Thr Ser Gly Thr Gly Thr Leu Thr Leu Thr Asn Asn Gly Thr Leu Asn 
610 615 620 

gtc gaa ggt gga gaa gtt tac tta ggt gtt ttt gag cct get gta gga 1920 
Val Glu Gly Gly Glu Val Tyr Leu Gly Val Phe Glu Pro Ala Val Gly 
625 630 635 640 

acg eta aac att ggt get get cac ggt gag gcg gcg gca gat gee gga 1968 
Thr Leu Asn lie Gly Ala Ala His Gly Glu Ala Ala Ala Asp Ala Gly 
645 650 655 

ttt ate acc' aat gcg acg aaa gtg gag ttt ggt ctt ggc gaa ggc gtt 2016 
Phe He Thr Asn Ala Thr Lys Val Glu Phe Gly Leu Gly Glu Gly Val 
660 665 670 

ttt gtc ttt aat cat acc aat aac agt gat gec ggc tac cag gtc gat 2064 
Phe Val Phe Asn His Thr Asn Asn Ser Asp Ala Gly Tyr Gin Val Asp 
675 680 685 

atg ctg att aca ggt gac gat aaa gac gga aaa gtg ate cat gat gca 2112 
Met Leu He Thr Gly Asp Asp Lys Asp Gly Lys Val He His Asp Ala 
690 695 700 

ggc cat acg gtg ttc aat gca ggg aat act tat age ggt aaa acg ctg 2160 
Gly His Thr Val Phe Asn Ala Gly Asn Thr Tyr Ser Gly Lys Thr Leu 
705 710 715 720 

gtc aat gac ggc etc ctg acc att gcg tct cat acg gca gat ggg gta 2208 
Val Asn Asp Gly Leu Leu Thr He Ala Ser His Thr Ala Asp Gly Val 
725 730 735 

acg ggc atg ggg teg agt gaa gta acc att gca aac ccc ggt acg etc 2256 
Thr Gly Met Gly Ser Ser Glu Val Thr He Ala Asn Pro Gly Thr Leu 
740 745 750 

gac att etc gca tea acg aac agt gca gga gat tac acg ctg acc aat 2304 
Asp He Leu Ala Ser Thr Asn Ser Ala Gly Asp Tyr Thr Leu Thr Asn 
755 760 765 

gcg etc aaa ggc gat ggc ttg atg cga gtg cag ctg tea tec tec gac 2352 
Ala Leu Lys Gly Asp Gly Leu Met Arg Val Gin Leu Ser Ser Ser Asp 
770 775 780 

aag atg ttt ggc ttt acc cat gca aca ggg act gaa ttc gee ggt gtt 2400 
Lys Met Phe Gly Phe Thr His Ala Thr Gly Thr Glu Phe Ala Gly Val 
785 790 795 800 

gee caa ctg aaa gac agt acc ttc act ctg gaa cgc gac aac acc get 2448 
Ala Gin Leu Lys Asp Ser Thr Phe Thr Leu Glu Arg Asp Asn Thr Ala 
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805 810 815 

gcg ctt act cac gcg atg ttg cag tct gac agt gaa aat acc aca teg 2496 
Ala Leu Thr His Ala Met Leu Gin Ser Asp Ser Glu Asn Thr Thr Ser 
820 825 830 

gta aaa gtt gga gag caa tec att ggt gga ctg gec atg aat gga ggt 2544 
Val Lys Val Gly Glu Gin Ser lie Gly Gly Leu Ala Met Asn Gly Gly 
835 840 845 

acc ate att ttc gat acg gat att cct get gcg acg etc gcg gag gga 2592 
Thr lie lie Phe Asp Thr Asp lie Pro Ala Ala Thr Leu Ala Glu Gly 
850 855 860 

tat ate age gtc gat acg ctg gtt gtc ggc gcg ggt gac tac acc tgg 2640 
Tyr lie Ser Val Asp Thr Leu Val Val Gly Ala Gly Asp Tyr Thr Trp 
865 870 875 880 

aaa ggc cgt aac tat cag gta aac ggg acg ggc gac gtg ctt ate gac 2688 
Lys Gly Arg Asn Tyr Gin Val Asn Gly Thr Gly Asp Val Leu lie Asp 
885 890 895 

gtg cct aaa ccg tgg aat gat ccc atg gcg aat aac cct ctg acg acg 2736 
Val Pro Lys Pro Trp Asn Asp Pro Met Ala Asn Asn Pro Leu Thr Thr 
900 905 910 

etc aat ttg ctg gaa cac gac gat age cat gtc ggc gtt caa ctg gtg 2784 
Leu Asn Leu Leu Glu His Asp Asp Ser His Val Gly Val Gin Leu Val 
915 920 925 

aag gcg caa acg gtt att ggg teg ggt ggc tea tta acg tta cgt gat 2832 
Lys Ala Gin Thr Val lie Gly Ser Gly Gly Ser Leu Thr Leu Arg Asp 
930 935 940 

tta cag ggc gac gag gtg gaa gcg gac aaa acg tta cac att gcg caa 2880 
Leu Gin Gly Asp Glu Val Glu Ala Asp Lys Thr Leu His lie Ala Gin 
945 950 955 960 

aac gga acg gtg gtc gee gag ggt gat tat gga ttc cgc etc acg acc 2928 
Asn Gly Thr Val Val Ala Glu Gly Asp Tyr Gly Phe Arg Leu Thr Thr 
965 970 975 

gca cca ggt aat ggt ttg tac gtt aac tat ggg ctg aaa gcg ctg aac 2976 
Ala Pro Gly Asn Gly Leu Tyr Val Asn Tyr Gly Leu Lys Ala Leu Asn 
980 985 990 

ate cat ggt ggg caa aag ctg acg tta gec gaa cat ggc gga gec tat 3024 
lie His Gly Gly Gin Lys Leu Thr Leu Ala Glu His Gly Gly Ala Tyr 
995 1000 1005 

ggc gca acg gee gat atg teg gca aaa ate ggt ggt gaa ggg gat ctg 3072 
Gly Ala Thr Ala Asp Met Ser Ala Lys lie Gly Gly Glu Gly Asp Leu 
1010 1015 1020 

gca ate aat acg gtg cga cag gtt teg ctt tec aac ggt cag aac gac 3120 
Ala lie Asn Thr Val Arg Gin Val Ser Leu Ser Asn Gly Gin Asn Asp 
1025 1030 1035 1040 

tat cag ggg gca acc tac gtt cag atg ggg aca tta cgt acc gat gcg 3168 
Tyr Gin Gly Ala Thr Tyr Val Gin Met Gly Thr Leu Arg Thr Asp Ala 
1045 1050 1055 
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gat ggt gca ctt ggc aac acc egg gaa ctg aac ate age aac gcg gec 3216 
Asp Gly Ala lieu Gly Asn Thr Arg Glu Leu Asn lie Ser Asn Ala Ala 
1060 1065 1070 

ate gtc gat ctt aat gga teg acg cag acg gta gag aca ttc acc ggg 3264 
lie Val Asp Leu Asn Gly Ser Thr Gin Thr Val Glu Thr Phe Thr Gly 
1075 1080 1085 

cag atg ggt teg act gtt ttg ttc aaa gag ggg gcg ctg acg gta aat 3312 
Gin Met Gly Ser Thr Val Leu Phe Lys Glu Gly Ala Leu Thr Val Asn 
1090 1095 1100 

aaa ggt ggg ate agt cag ggt gaa ctg aca ggt ggc gga aac ctg aat 3360 
Lys Gly Gly lie Ser Gin Gly Glu Leu Thr Gly Gly Gly Asn Leu Asn 
1105 1110 1115 1120 

gtt aca ggg gga acg ctg get ate gag ggg ctt aat gca cgc tac aat 3408 
Val Thr Gly Gly Thr Leu Ala lie Glu Gly Leu Asn Ala Arg Tyr Asn 
1125 . 1130 1135 

gcg tta acc age att age cca aat gcg gaa gtc age etc gat aat act 3456 
Ala Leu Thr Ser lie Ser Pro Asn Ala Glu Val Ser Leu Asp Asn Thr 
1140 1145 1150 

cag ggg tta ggc aga gga aat att gec aat gac ggt ctg tta acg eta 3504 
Gin Gly Leu Gly Arg Gly Asn lie Ala Asn Asp Gly Leu Leu Thr Leu 
1155 1160 1165 

aaa aac gtg act ggc gaa ctg cgt aat age ata age ggg aag ggt ate 3552 
Lys Asn Val Thr Gly Glu Leu Arg Asn Ser lie Ser Gly Lys Gly lie 
1170 1175 1180 

gtg age gca acc gee agg aca gat gta gag ttg gat ggc gat aat age 3600 
Val Ser Ala Thr Ala Arg Thr Asp Val Glu Leu Asp Gly Asp Asn Ser 
1185 1190 1195 1200 

cgc ttt gtg ggg caa ttc aac att gat aca ggc age gcg etc age gtc 3648 
Arg Phe Val Gly Gin Phe Asn lie Asp Thr Gly Ser Ala Leu Ser Val 
1205 1210 1215 

aac gag cag aaa aac ctg ggt gat get tec gtt ate aat aat ggc ctg 3696 
Asn Glu Gin Lys Asn Leu Gly Asp Ala Ser Val He Asn Asn Gly Leu 
1220 1225 k 1230 

etc acc ate tec act gag cgt age tgg gcg atg acg cac agt ate age 3744 
Leu Thr He Ser Thr Glu Arg Ser Trp Ala Met Thr His Ser He Ser 
1235 1240 1245 

ggt age ggt gat gtg aca aaa ctg ggt acc ggg ate ctg act ctt aac 3792 
Gly Ser Gly Asp Val Thr Lys Leu Gly Thr Gly He Leu Thr Leu Asn 
1250 1255 1260 

aac gat tec gcg gcg tat cag ggt acg acg gat ate gtg ggg ggg gaa 3840 
Asn Asp Ser Ala Ala Tyr Gin Gly Thr Thr Asp He Val Gly Gly Glu 
1265 1270 1275 1280 

att get ttc ggt tec gac tct gee att aat atg gca agt caa cac att 3888 
He Ala Phe Gly Ser Asp Ser Ala He Asn Met Ala Ser Gin His He 
1285 1290 1295 
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aat ate cat aac age ggt gtg atg teg gga aat gtc acc act gca ggt 3936 
Asn lie His Asn Ser Gly Val Met Ser Gly Asn Val Thr Thr Ala Gly 
1300 1305 1310 

gat atg aac gtt atg cct ggg ggg ggc act gcg tgt cgc taa 3978 
Asp Met Asn Val Met Pro Gly Gly Gly Thr Ala Cys Arg * 
1315 1320 1325 



<210> 112 
<211> 756 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (756) 

<400> 112 

atg act gaa gca caa aga cat caa ate etc ctg gaa atg etc gca caa 48 
Met Thr Glu Ala Gin Arg His Gin lie Leu Leu Glu Met Leu Ala Gin 
15 10 15 

ttg ggc ttt gtg acc gtt gag aaa gtc gtt gag cgt ctg gga att teg 96 
Leu Gly Phe Val Thr Val Glu Lys Val Val Glu Arg Leu Gly He Ser 
20 25 30 

cct gee act gcg cga cgc gat ate aat aaa ctt gac gaa age ggc aaa 144 
Pro Ala Thr Ala Arg Arg Asp He Asn Lys Leu Asp Glu Ser Gly Lys 
35 40 45 

ctg aaa aaa gtg cgc aat ggc gca gaa get att acc caa cag cgc ccg 192 
Leu Lys Lys Val Arg Asn Gly Ala Glu Ala He Thr Gin Gin Arg Pro 
50 55 60 

cgc tgg acg ccg atg aat ctg cat cag gcg cag aat cac gat gaa aaa 240 
Arg Trp Thr Pro Met Asn Leu His Gin Ala Gin Asn His Asp Glu Lys 
65 70 75 80 

gta cgt ate get aaa gcg gec teg cag ctg gtt aat ccg ggc gaa age 288 
Val Arg He Ala Lys Ala Ala Ser Gin Leu Val Asn Pro Gly Glu Ser 
85 90 95 

gta gtc ate aac tgc ggc tec acc gcg ttt ctg ctt ggg egg gaa atg 336 
Val Val He Asn Cys Gly Ser Thr Ala Phe Leu Leu Gly Arg Glu Met 
100 105 110 

tgt ggc aag cca gtg caa ate ate act aat tat eta ccg ctg gca aat 384 
Cys Gly Lys Pro Val Gin He He Thr Asn Tyr Leu Pro Leu Ala Asn 
115 120 125 

tac ctg ate gat caa gaa cat gac age gtg ate att atg ggc gga cag 432 
Tyr Leu He Asp Gin Glu His Asp Ser Val He He Met Gly Gly Gin 
130 135 140 

tac aac aaa agt cag tec ate act tta age ccg cag ggc age gaa aac 480 
Tyr Asn Lys Ser Gin Ser He Thr Leu Ser Pro Gin Gly Ser Glu Asn 
145 150 155 160 

agt etc tat gee ggg cac tgg atg ttt acc age gga aaa ggg ctg acc 528 
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Ser Leu Tyr Ala Gly His Trp Met Phe Thr Ser Gly Lys Gly Leu Thr 
165 170 175 

gca gaa ggg ttg tat aaa acc gat atg ctg aca gca atg gca gag cag 576 
Ala Glu Gly Leu Tyr Lys Thr Asp Met Leu Thr Ala Met Ala Glu Gin 
180 185 190 

aag atg ctg age gtg gta ggg aaa ctg gtg gta ctg gtt gat age agt 624 
Lys Met Leu Ser Val Val Gly Lys Leu Val Val Leu Val Asp Ser Ser 
195 200 205 

aag att ggc gaa cgc gcg gga atg ctt ttt age cgt gee gat caa ate 672 
Lys He Gly Glu Arg Ala Gly Met Leu Phe Ser Arg Ala Asp Gin He 
210 215 220 

gat atg ctt ate acc ggc aaa aat get aac ccg gaa ate ctg caa caa 720 
Asp Met Leu He Thr Gly Lys Asn Ala Asn Pro Glu He Leu Gin Gin 
225 230 235 240 

ctg gaa gcg caa ggg gtc age att ctg cgt gtt taa 756 
Leu Glu Ala Gin Gly Val Ser He Leu Arg Val * 
245 250 



<210> 113 
<211> 825 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (825) 

<400> 113 

atg acc gaa ttt aca act ctt ctt cag caa gga aac gec tgg ttc ttc 48 
Met Thr Glu Phe Thr Thr Leu Leu Gin Gin Gly Asn Ala Trp Phe Phe 
15 10 15 

ate ccc age gee ate tta ctt ggt gcg ctt cat ggc ctg gaa cca ggg 96 
He Pro Ser Ala He Leu Leu Gly Ala Leu His Gly Leu Glu Pro Gly 
20 25 30 

cac tea aaa acg atg atg gcg gcg ttt ate ate gee ate aaa ggc acc 144 
His Ser Lys Thr Met Met Ala Ala Phe He He Ala He Lys Gly Thr 
35 40 45 

att aaa caa gcg gtg atg etc gga ctg gca gca act att teg cat acc 192 
He Lys Gin Ala Val Met Leu Gly Leu Ala Ala Thr He Ser His Thr 
50 55 60 

gca gtg gtc tgg tta att gee ttt ggc ggg atg gtg ate age aag cgc 240 
Ala Val Val Trp Leu He Ala Phe Gly Gly Met Val He Ser Lys Arg 
65 70 75 B0 

ttt act get caa tea gca gaa ccg tgg etc cag ctg att tec gca gtg 288 
Phe Thr Ala Gin Ser Ala Glu Pro Trp Leu Gin Leu He Ser Ala Val 
85 90 95 

ate att att age acc gcg ttc tgg atg ttc tgg cgt acc tgg cgc ggc 336 
He He He Ser Thr Ala Phe Trp Met Phe Trp Arg Thr Trp Arg Gly 
100 105 110 
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gaa cgc aac tgg ctg gag aat atg cac ggg cat gat tat gag cat cat 384 

Glu Arg Asn Trp Leu Glu Asn Met His Gly His Asp Tyr Glu His His 
115 120 " 125 

cat cac gat cac gaa cat cac cac gac cat gga cat cat cac cat cac 432 

His His Asp His Glu His His His Asp His Gly His His His His His 
130 135 140 

gaa cat ggc gag tat cag gat gcc cat gca cga gcc cat gcc aat gac 480 

Glu His Gly Glu Tyr Gin Asp Ala His Ala Arg Ala His Ala Asn Asp 

145 150 155 160 

att aaa cga cgc ttt gat ggt aga gag gtc acc aac tgg caa att ttg 528 

lie Lys Arg Arg Phe Asp Gly Arg Glu Val Thr Asn Trp Gin lie Leu 
165 170 175 

tta ttt ggc tta acc ggt ggc ctt ate ccc tgc ccg gca gca att acc 576 

Leu Phe Gly Leu Thr Gly Gly Leu He Pro Cys Pro Ala Ala He Thr 
180 185 190 

gtg ctg ttg ata tgc att cag ttg aaa gcc ctg aca ctg ggc gca aca 624 

Val Leu Leu He Cys He Gin Leu Lys Ala Leu Thr Leu Gly Ala Thr 
195 200 205 

ctg gtc gtc agt ttc age att ggc ctg gcg tta acg ctt gtc acc gta 672 

Leu Val Val Ser Phe Ser He Gly Leu Ala Leu Thr Leu Val Thr Val 
210 215 220 

ggc gtt ggc gca gca ate age gtt cag cag gtc gca aaa cgc tgg age 720 

Gly Val Gly Ala Ala He Ser Val Gin Gin Val Ala Lys Arg Trp Ser 

225 230 235 240 

gga ttt aac act etc get aaa cgc gcc ccc tat ttt tec agt ctg ttg 768 

Gly Phe Asn Thr Leu Ala Lys Arg Ala Pro Tyr Phe Ser Ser Leu Leu 
245 250 255 

att ggc tta gtc ggt gtg tat atg ggc gta cat ggc ttc atg ggc ata 816 

He Gly Leu Val Gly Val Tyr Met Gly Val His Gly Phe Met Gly He 
260 265 270 

atg cga taa 825 
Met Arg * 



<210> 114 
<211> 519 
<212> DNA 

<213> Escherichia coli ) 

<220> 

<221> CDS 

<222> (1)...(519) 

<400> 114 

atg att ctt aaa tea get att tec get gac tct ctg etc get aag gat 48 
Met He Leu Lys Ser Ala He Ser Ala Asp Ser Leu Leu Ala Lys Asp 
1 5 10 15 

get ttt agg gca tec ttt cat tta cac ttt tta cga aat cat ggg ate 96 
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Ala Phe Arg Ala Ser Phe His Leu His Phe Leu Arg Asn His Gly lie 
20 25 30 

act aac aaa ata teg ctt gtc agt tat att gta tgg cag gaa aga tat 144 
Thr Asn Lys lie Ser Leu Val Ser Tyr lie Val Trp Gin Glu Arg Tyr 
35 40 45 

gcg act gat att aca gat ccc caa agt gga gag ttt atg acc att aaa 192 
Ala Thr Asp lie Thr Asp Pro Gin Ser Gly Glu Phe Met Thr lie Lys 
50 55 60 

aat aag atg ttg ctg ggt gcg ctt ttg ctg gtt acc agt gec gec tgg 240 
Asn Lys Met Leu Leu Gly Ala Leu Leu Leu Val Thr Ser Ala Ala Trp 
65 70 75 80 

gec gca cca gec acc gcg ggt teg acc aat acc teg gga att tct aag 288 
Ala Ala Pro Ala Thr Ala Gly Ser Thr Asn Thr Ser Gly lie Ser Lys 
85 90 95 

tat gag tta agt agt ttc att get gac ttt aag cat ttc aaa cca ggg 336 
Tyr Glu Leu Ser Ser Phe lie Ala Asp Phe Lys His Phe Lys Pro Gly 
100 105 110 

gac acc gta cca gaa atg tac cgt acc gat gag tac aac att aag cag 384 
Asp Thr Val Pro Glu Met Tyr Arg Thr Asp Glu Tyr Asn He Lys Gin 
115 120 125 

tgg cag ttg cgt aac ctg ccc gcg cct gat gec ggg acg cac tgg acc 432 
Trp Gin Leu Arg Asn Leu Pro Ala Pro Asp Ala Gly Thr His Trp Thr 
130 135 140 

tat atg ggt ggc gcg tac gtg ttg ate age gac acc gac ggt aaa ate 480 
Tyr Met Gly Gly Ala Tyr Val Leu He Ser Asp Thr Asp Gly Lys He 
145 150 155 160 

att aaa gec tac gac ggt gag att ttt tat cat cgc taa 519 
He Lys Ala Tyr Asp Gly Glu He Phe Tyr His Arg * 
165 170 



<210> 115 
<211> 1035 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1035) 
<400> 115 

atg gaa att cgc ata atg eta ttt ata tta atg atg atg gtt atg cct 48 
Met Glu He Arg He Met Leu Phe He Leu Met Met Met Val Met Pro 
15 10 15 

gtg age tat gcg gca tgt tat agt gag tta tct gtt cag cac aac ttg 96 
Val Ser Tyr Ala Ala Cys Tyr Ser Glu Leu Ser Val Gin His Asn Leu 
. 20 25 30 

gtt gtt cag ggg gat ttt gca ctt act caa aca caa atg gcg aca tat 144 
Val Val Gin Gly Asp Phe Ala Leu Thr Gin Thr Gin Met Ala Thr Tyr 
35 40 45 
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gag cat aat ttt aat gat teg tea tgc gta agt aca aat act ate acc 192 
Glu His Asn Phe Asn Asp Ser Ser Cys Val Ser Thr Asn Thr lie Thr 
50 55 * 60 

cct atg age ccg teg gat att att gtt gga ctt tat aac gat acc ata 240 
Pro Met Ser Pro Ser Asp lie lie Val Gly Leu Tyr Asn Asp Thr He 
65 70 75 80 

aaa tta aat tta cat ttt gaa tgg acc aat aaa aac aac ate acg ttg 288 
Lys Leu Asn Leu His Phe Glu Trp Thr Asn Lys Asn Asn He Thr Leu 
85 90 95 

tea aat aat cag acc agt ttc acc agt ggt tat tea gtt acg gtg aca 336 
Ser Asn Asn Gin Thr Ser Phe Thr Ser Gly Tyr Ser Val Thr Val Thr 
100 105 110 

cct gcg gee agt aat gca aaa gtg aat gtt tct gcg ggg ggc ggc ggt 384 
Pro Ala Ala Ser Asn Ala Lys Val Asn Val Ser Ala Gly Gly Gly Gly 
115 120 125 

tea gtg atg att aat ggt gtt gcg aca tta tec agt get tea tea teg 432 
Ser Val Met He Asn Gly Val Ala Thr Leu Ser Ser Ala Ser Ser Ser 
130 135 140 

aca cgc ggg agt gee gca gta caa ttt eta ctg tgt tta tta ggt ggc 480 
Thr Arg Gly Ser Ala Ala Val Gin Phe Leu Leu Cys Leu Leu Gly Gly 
145 150 155 160 

aag tea tgg gat gca tgt gta aat age tac aga aat gca ttg gca caa 528 
Lys Ser Trp Asp Ala Cys Val Asn Ser Tyr Arg Asn Ala Leu Ala Gin 
165 170 ~* 175 

aat gca ggt gtc tat tec ttt aat ctg aca ttg tea tac aac ccg ata 576 
Asn Ala Gly Val Tyr Ser Phe Asn Leu Thr Leu Ser Tyr Asn Pro He 
180 185 190 

acc aca acc tgc aaa ccg gac gat tta tta att act tta gac agt att 624 
Thr Thr Thr Cys Lys Pro Asp Asp Leu Leu He Thr Leu Asp Ser He 
195 200 205 

ccc gtt tea caa tta cca gec aca ggt aac aaa gca aca ata aat agt 672 
Pro Val Ser Gin Leu Pro Ala Thr Gly Asn Lys Ala Thr He Asn Ser 
210 215 220 

aaa caa ggg gat att att ctg cgt tgt aaa aat tta tta ggt caa caa 720 
Lys Gin Gly Asp He He Leu Arg Cys Lys Asn Leu Leu Gly Gin Gin 
225 230 235 240 

aat caa aca tea egg aaa atg cag gtg tat tta tea agt tct gac ttg 768 
Asn Gin Thr Ser Arg Lys Met Gin Val Tyr Leu Ser Ser Ser Asp Leu 
245 250 255 

tta acc aac age aac aca ata ctg aaa ggt gcg gaa gat aat ggc gta 816 
Leu Thr Asn Ser Asn Thr He Leu Lys Gly Ala Glu Asp Asn Gly Val 
260 265 ~ 270 

gga ttt att ctt gaa agt aat ggt teg cca gtc aca ctt tta aat ate 864 
Gly Phe He Leu Glu Ser Asn Gly Ser Pro Val Thr Leu Leu Asn He 
275 280 285 
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act aac age agt aaa gga tat aca aat tta aag gaa gtt gcg gcg aag 912 
Thr Asn Ser Ser Lys Gly Tyr Thr Asn Leu Lys Glu Val Ala Ala Lys 
290 295 300 

tea aaa ctt aca gat aca acg gtt tea att ccg ata aca gee agt tac 960 
Ser Lys Leu Thr Asp Thr Thr Val Ser lie Pro lie Thr Ala Ser Tyr 
305 310 315 320 

tac gtc tac gat aca aac aaa gtt aaa tct ggc gca ctg gag gca acc 1008 
Tyr Val Tyr Asp Thr Asn Lys Val Lys Ser Gly Ala Leu Glu Ala Tor 
325 330 335 

gca tta ate aac gtg aaa tac gac taa 1035 
Ala Leu lie Asn Val Lys Tyr Asp * 
340 



<210> 116 
<211> 2481 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2481) 

<400> 116 

atg ttg aga atg acc cca ctt gca tea gca ate gta gcg tta ttg etc 48 
Met Leu Arg Met Thr Pro Leu Ala Ser Ala lie Val Ala Leu Leu Leu 
15 10 15 

ggc att gaa get tat gca get gaa gaa acc ttt gat acc cat ttt atg 96 
Gly lie Glu Ala Tyr Ala Ala Glu Glu Thr Phe Asp Thr His Phe Met 
20 25 30 

ata ggt gga atg aaa gac cag cag gtt gca aat att cgt ctt gat gat 144 
lie Gly Gly Met Lys Asp Gin Gin Val Ala Asn lie Arg Leu Asp Asp 
35 40 45 

aat caa ccc tta ccg ggg cag tat gac ate gat att tat gtc aat aag 192 
Asn Gin Pro Leu Pro Gly Gin Tyr Asp lie Asp lie Tyr Val Asn Lys 
50 55 60 

caa tgg cgc ggg aaa tat gag att att gtt aaa gac aac ccg caa gaa 240 
Gin Trp Arg Gly Lys Tyr Glu He He Val Lys Asp Asn Pro Gin Glu 
65 70 75 80 

aca tgt tta tea aga gaa gtt ate aag egg tta ggc att aat age gat 288 
Thr Cys Leu Ser Arg Glu Val He Lys Arg Leu Gly He Asn Ser Asp 
85 90 95 

aac ttc gee age ggt aag caa tgt tta aca ttt gag caa ctt gtt cag 336 
Asn Phe Ala Ser Gly Lys Gin Cys Leu Thr Phe Glu Gin Leu Val Gin 
100 105 110 

ggt ggg age tat acc tgg gat ate ggg gtt ttt cgt etc gat ttc agt 384 
Gly Gly Ser Tyr Thr Trp Asp He Gly Val Phe Arg Leu Asp Phe Ser 
115 120 125 

gtc ccg cag gee tgg gtg gaa gaa ctg gaa agt ggc tat gtt cca ccg 432 
Val Pro Gin Ala Trp Val Glu Glu Leu Glu Ser Gly Tyr Val Pro Pro 
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130 135 140 

gaa aac tgg gag egg ggt att aat gcg ttt tat acc tct tat tat ctg 480 
Glu Asn Trp Glu Arg Gly lie Asn Ala Phe Tyr Thr Ser Tyr Tyr Leu 
145 150 155 160 

agt cag tat tac age gac tat aaa gcg teg ggt aat aac aag agt aca 528 
Ser Gin Tyr Tyr Ser Asp Tyr Lys Ala Ser Gly Asn Asn Lys Ser Thr 
165 170 175 

tat gta cgt ttt aac age ggg tta aat tta. ctg ggg tgg caa ctg cat 576 
Tyr Val Arg Phe Asn Ser Gly Leu Asn Leu Leu Gly Trp Gin Leu His 
180 185 190 

tct gat gec agt ttc agt aaa aca aat aac aat cca ggg gtg tgg aaa 624 
Ser Asp Ala Ser Phe Ser Lys Thr Asn Asn Asn Pro Gly Val Trp Lys 
195 200 205 

age aat acc ctg tat ctg gaa cgt gga ttt gee caa ctt etc ggc acg 672 
Ser Asn Thr Leu Tyr Leu Glu Arg Gly Phe Ala Gin Leu Leu Gly Thr 
210 215 220 

ctt cgc gtg ggt gat atg tac aca tea age gat att ttt gat tct gtt 720 
Leu Arg Val Gly Asp Met Tyr Thr Ser Ser Asp lie Phe Asp Ser Val 
225 230 235 240 

cgc ttc aga ggt gtg egg ttg ttt cgt gat atg cag atg ttg cct aac 768 
Arg Phe Arg Gly Val Arg Leu Phe Arg Asp Met Gin Met Leu Pro Asn 
245 250 255 

teg aaa caa aat ttt acg cca egg gtg cag ggg att get cag agt aac 816 
Ser Lys Gin Asn Phe Thr Pro Arg Val Gin Gly lie Ala Gin Ser Asn 
260 265 270 

gcg ctg gta act att gaa cag aat ggt ttt gtg gtt tat cag aaa gag 864 
Ala Leu Val Thr lie Glu Gin Asn Gly Phe Val Val Tyr Gin Lys Glu 
275 280 285 

gtt cct cct ggc ccg ttc gcg att aca gat ttg cag ttg gee ggt ggt 912 
Val Pro Pro Gly Pro Phe Ala He Thr Asp Leu Gin Leu Ala Gly Gly 
290 295 300 

gga gca gat ctt gat gtc age gtg aaa gag gcg gac ggc teg gta acc 960 
Gly Ala Asp Leu Asp Val Ser Val Lys Glu Ala Asp Gly Ser Val Thr 
305 310 315 320 

acc tat ctg gtg cct tat gca gcg gtg cca aat atg ctg caa ccc ggc 1008 
Thr Tyr Leu Val Pro Tyr Ala Ala Val Pro Asn Met Leu Gin Pro Gly 
325 330 335 

gtg teg aaa tat gat tta gcg gcg ggt cgt age cat att gaa ggg gcg 1056 
Val Ser Lys Tyr Asp Leu Ala Ala Gly Arg Ser His He Glu Gly Ala 
340 345 350 

age aaa caa agt gat ttt gtc cag gcg ggt tat cag tat ggt ttt aat 1104 
Ser Lys Gin Ser Asp Phe Val Gin Ala Gly Tyr Gin Tyr Gly Phe Asn 
355 360 365 

aat tta ttg acg ctg tat ggt ggc teg atg gtc gcg aat aat tat tac 1152 
Asn Leu Leu Thr Leu Tyr Gly Gly Ser Met Val Ala Asn Asn Tyr Tyr 
370 375 380 
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gcg ttt act ttg ggg get ggc tgg aat aca cgc att ggt gec att tec 1200 
Ala Plie Thr Leu Gly Ala Gly Trp Asn Thr Arg He Gly Ala He Ser 
385 390 395 400 

gtc gat gec act aag teg cat agt aaa caa gac aac ggc gat gtg ttt 1248 
Val Asp Ala Thr Lys Ser His Ser Lys Gin Asp Asn Gly Asp Val Phe 
405 410 415 

gac ggg caa agt tat caa att gee tac aac aaa ttt gtg age caa acg 1296 
Asp Gly Gin Ser Tyr Gin He Ala Tyr Asn Lys Phe Val Ser Gin Thr 
420 425 430 

teg acg cgt ttt ggt ctg gcg gee tgg cgt tat teg teg cgt gat tac 1344 
Ser Thr Arg Phe Gly Leu Ala Ala Trp Arg Tyr Ser Ser Arg Asp Tyr 
435 440 445 

egg aca ttt aac gat cac gtt tgg gca aac aat aaa gat aat tat cgc 1392 
Arg Thr Phe Asn Asp His Val Trp Ala Asn Asn Lys Asp Asn Tyr Arg 
450 455 460 

cgt gat gaa aac gat gtc tat gac att gee gat tat tac cag aac gat 1440 
Arg Asp Glu Asn Asp Val Tyr Asp He Ala Asp Tyr Tyr Gin Asn Asp 
465 470 475 480 

ttt ggc cgc aaa aat age ttt tec gec aat atg age cag tea ttg cca 1488 
Phe Gly Arg Lys Asn Ser Phe Ser Ala Asn Met Ser Gin Ser Leu Pro 
485 490 495 

gaa ggt tgg ggg tct gtg tea tta agt acg tta tgg cga gat tac tgg 1536 
Glu Gly Trp Gly Ser Val Ser Leu Ser Thr Leu Trp Arg Asp Tyr Trp 
500 505 510 

ggg cgt age ggc agt agt aag gat tat cag ttg agt tat tec aac aac 1584 
Gly Arg Ser Gly Ser Ser Lys Asp Tyr Gin Leu Ser Tyr Ser Asn Asn 
515 520 525 

ctg cga egg ata age tat acc etc gcg gca age cag get tat gac gag 1632 
Leu Arg Arg He Ser Tyr Thr Leu Ala Ala Ser Gin Ala Tyr Asp Glu 
530 535 540 

aat cat cat gaa gag aaa cgt ttt aat att ttt ata teg att ccc ttt 1680 
Asn His His Glu Glu Lys Arg Phe Asn He Phe He Ser He Pro Phe 
545 550 555 560 

gat tgg ggt gat gac gtt teg acg cct cgt egg caa ata tat atg tct 1728 
Asp Trp Gly Asp Asp Val Ser Thr Pro Arg Arg Gin He Tyr Met Ser 
565 570 575 

aac tea acg acg ttt gat gat cag ggg ttt gee tea aat aat acg gga 1776 
Asn Ser Thr Thr Phe Asp Asp Gin Gly Phe Ala Ser Asn Asn Thr Gly 
580 585 590 

tta tea gga aca gta ggg agt egg gat cag ttc aat tat ggt gtc aac 1824 
Leu Ser Gly Thr Val Gly Ser Arg Asp Gin Phe Asn Tyr Gly Val Asn 
595 600 605 

ctg agt cat caa cat cag gga aat gaa acg aca get ggg gcg aat ttg 1872 
Leu Ser His Gin His Gin Gly Asn Glu Thr Thr Ala Gly Ala Asn Leu 
610 615 620 
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acc tgg aac gcg ccg gtt gcg aca gtg aat ggc agt tat agt cag teg 1920 
Thr Trp Asn Ala Pro Val Ala Thr Val Asn Gly Ser Tyr Ser Gin Ser 
625 630 635 640 

agt act tat cga cag get gga gec agt gtt tea ggg ggc att gtc gec 1968 
Ser Thr Tyr Arg Gin Ala Gly Ala Ser Val Ser Gly Gly lie Val Ala 
645 650 655 

tgg teg ggt ggc gtt aat ctg gcg aac cgt ctt tec gaa acg ttt get 2016 
Trp Ser Gly Gly Val Asn Leu Ala Asn Arg Leu Ser Glu Thr Phe Ala 
660 665 670 

gtg atg aat gcg cca gga att aaa gat get tat gtc aat ggg caa aaa 2064 
Val Met Asn Ala Pro Gly lie Lys Asp Ala Tyr Val Asn Gly Gin Lys 
675 680 685 

tat cgc aca aca aac cgt aat gga gtg gtg ata tac gac gga atg aca 2112 
Tyr Arg Thr Thr Asn Arg Asn Gly Val Val lie Tyr Asp Gly Met Thr 
690 695 700 

cct tat egg gaa aat cac ctg atg ctg gat gtg teg caa age gat age 2160 
Pro Tyr Arg Glu Asn His Leu Met Leu Asp Val Ser Gin Ser Asp Ser 
705 710 " 715 ~ 720 

gaa gca gaa tta cgt ggc aac egg aaa att gee gec cct tat cgc ggc 2208 
Glu Ala Glu Leu Arg Gly Asn Arg Lys lie Ala Ala Pro Tyr Arg Gly 
725 730 735 

gcg gtt gta ctg gtt aat ttt gat acc gat cag cgc aag cca tgg ttt 2256 
Ala Val Val Leu Val Asn Phe Asp Thr Asp Gin Arg Lys Pro Trp Phe 
740 745 750 

ata aaa gcg tta aga gca gat ggg caa tea tta acg ttt ggt tat gaa 2304 
lie Lys Ala Leu Arg Ala Asp Gly Gin Ser Leu Thr Phe Gly Tyr Glu 
755 760 765 

gtc aat gat ate cat ggt cat aat att ggc gtt gtc ggc cag gga agt 2352 
Val Asn Asp He His Gly His Asn He Gly Val Val Gly Gin Gly Ser 
770 775 780 

cag tta ttt att cgc acc aat gaa gta ccg cca teg gtt aat gtg gca 2400 
Gin Leu Phe He Arg Thr Asn Glu Val Pro Pro Ser Val Asn Val Ala 
785 790 795 800 

att gat aag caa caa gga ctt tea tgc aca ate acc ttc ggt aaa gag 2448 
He Asp Lys Gin Gin Gly Leu Ser Cys Thr He Thr Phe Gly Lys Glu 
805 810 815 

att gat gaa agt aga aat tat att tgc cag taa 2481 
He Asp Glu Ser Arg Asn Tyr He Cys Gin * 
820 825 



<210> 117 
<211> 720 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (720) 
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<400> 117 

atg gcc get ate cca tgg egg cct ttt aat tta aga ggc att aaa atg 48 

Met Ala Ala lie Pro Trp Arg Pro Phe Asn Leu Arg Gly He Lys Met 
15 10 15 

aaa gga tta tta tct tta etc att ttt tct atg gtc ctt cct gca cat 96 
Lys Gly Leu Leu Ser Leu Leu He Plie Ser Met Val Leu Pro Ala His 
20 25 30 

gcc gga att gtt ate tac ggg acg cgc att att tac ccg gca gaa aat 144 
Ala Gly He Val He Tyr Gly Thr Arg He He Tyr Pro Ala Glu Asn 
35 40 45 

aaa gaa gtg atg gtg cag ttg atg aac cag gga aac cgt tct teg ctg 192 
Lys Glu Val Met Val Gin Leu Met Asn Gin Gly Asn Arg Ser Ser Leu 
50 55 60 

ctg cag gcg tgg att gat gat ggc gat acg tea tta cca cca gaa aaa 240 
Leu Gin Ala Trp He Asp Asp Gly Asp Thr Ser Leu Pro Pro Glu Lys 
65 70 75 80 

att cag gtt cct ttc atg tta acg cca cca gtg gca aaa ata ggg gca 288 
He Gin Val Pro Phe Met Leu Thr Pro Pro Val Ala Lys He Gly Ala 
85 90 95 

aat tec ggg cag caa gta aaa ate aaa att atg ccg aat aaa ctg ccc 336 
Asn Ser Gly Gin Gin Val Lys He Lys He Met Pro Asn Lys Leu Pro 
100 105 110 

act aat aaa gaa age att ttt tat ctg aat gtt ctg gat att cca cca 384 
Thr Asn Lys Glu Ser He Phe Tyr Leu Asn Val Leu Asp He Pro Pro 
115 120 125 

aat agt cca gag caa gaa ggt aag aat gca ctg aag ttt gcg atg caa 432 
Asn Ser Pro Glu Gin Glu Gly Lys Asn Ala Leu Lys Phe Ala Met Gin 
130 135 140 

\ 

aac aga att aag ttg ttt tac egg cca gcg ggt att get ccg gta aat 480 
Asn Arg He Lys Leu Phe Tyr Arg Pro Ala Gly lie Ala Pro Val Asn 
145 150 155 160 

aaa gcg aca ttt aaa aaa ttg ctg gta aat cgc agt ggc aat ggt ttg 528 
Lys Ala Thr Phe Lys Lys Leu Leu Val Asn Arg Ser Gly Asn Gly Leu 
165 170 175 

gtg ata aaa aat gac tea get aat tgg gtg acg att teg gat gtc aaa 576 
Val He Lys Asn Asp Ser Ala Asn Trp Val Thr He Ser Asp Val Lys 
180 185 190 

get aat aat gtc aaa gtc aat tat gaa act att atg att gcc ccc tta 624 
Ala Asn Asn Val Lys Val Asn Tyr Glu Thr He Met He Ala Pro Leu 
195 200 205 

gaa agt cag agt gtt aat gtc aaa agt aat aat gca aat aac tgg cat 672 
Glu Ser Gin Ser Val Asn Val Lys Ser Asn Asn Ala Asn Asn Trp His 
210 215 220 

ctg acc att ate gat gac cat ggc aac tat att agt gac aaa att taa 720 
Leu Thr He He Asp Asp His Gly Asn Tyr He Ser Asp Lys He * 
225 230 235 



-53- 



WO 01/48209 



PCTYUSOO/34419 



<210> 118 
<211> 543 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (543) 

<400> 118 

atg aaa cgt tea att att get gec get gtc ttt tct tct ttt ttt atg 48 . 
Met Lys Arg Ser lie lie Ala Ala Ala Val Phe Ser Ser Phe Phe Met 
15 10 15 

age get gga gta ttt get gca gae gtt gat ace gga aca tta act att 96 
Ser Ala Gly Val Phe Ala Ala Asp Val Asp Thr Gly Thr Leu Thr He 
20 25 30 

aag ggg aat att gca gaa tct ccg tgt aaa ttc gaa gcg ggt ggt gat 144 
Lys Gly Asn He Ala Glu Ser Pro Cys Lys Phe Glu Ala Gly Gly Asp 
35 40 45 

tea gta agt att aat atg ccg act gta cca acc agt gtc ttt gaa ggt 192 
Ser Val Ser He Asn Met Pro Thr Val Pro Thr Ser Val Phe Glu Gly 
50 55 60 

aaa get aaa tat tct acc tat gat gat gca gtc ggt gta acc age age 240 
Lys Ala Lys Tyr Ser Thr Tyr Asp Asp Ala Val Gly Val Thr Ser Ser 
65 70 75 80 

atg tta aaa att age tgc ccg aaa gaa gtt get ggt gta aaa etc teg 288 
Met Leu Lys He Ser Cys Pro Lys Glu Val Ala Gly Val Lys Leu Ser 
85 90 95 

ttg att acc aac gat aaa ata acc ggt aac gat aag gcg ata gec agt 336 
Leu He Thr Asn Asp Lys He Thr Gly Asn Asp Lys Ala He Ala Ser 
100 105 110 

age aac gat acc gtg ggt tac tat etc tat tta ggt gat aac age gat 384 
Ser Asn Asp Thr Val Gly Tyr Tyr Leu Tyr Leu Gly Asp Asn Ser Asp 
115 120 125 

gtc ctg gat gtt tct gca cct ttt aac att gag agt tat aaa aca gcg 432 
Val Leu Asp Val Ser Ala Pro Phe Asn He Glu Ser Tyr Lys Thr Ala 
130 135 140 

gaa ggt caa tat get att ccg ttt aaa gca aaa tac ctg aaa ctg aca 480 
Glu Gly Gin Tyr Ala He Pro Phe Lys Ala Lys Tyr Leu Lys Leu Thr 
145 150 155 160 

gat aac tea gtg caa tea ggt gat gtg tta tct tct ctg gtt atg cgt 528 
Asp Asn Ser Val Gin Ser Gly Asp Val Leu Ser Ser Leu Val Met Arg 
165 170 175 

gtg gcg cag gat taa 543 
Val Ala Gin Asp * 
180 
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<210> 119 
<211> 288 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (288) 

<400> 119 

atg ctg cac aca tta cat cgc tea ccc tgg ctg acg gat ttt get gcg 48 
Met Leu His Thr Leu His Arg Ser Pro Trp Leu Thr Asp Phe Ala Ala 
1 5 10 15 

ctg ctg cgt ctg etc agt gaa gga gac gaa ctg eta tta ttg caa gat 96 
Leu Leu Arg Leu Leu Ser Glu Gly Asp Glu Leu Leu Leu Leu Gin Asp 
20 25 30 

ggc gta act gec gca gtt gac ggt aac cgc tac ctt gaa agt ctg cgt 144 
Gly Val Thr Ala Ala Val Asp Gly Asn Arg Tyr Leu Glu Ser Leu Arg 
35 40 45 

aat gec ccc att aag gtc tat gee ctg aac gaa gac ctt att gee cgc 192 
Asn Ala Pro lie Lys Val Tyr Ala Leu Asn Glu Asp Leu lie Ala Arg 
50 55 60 

ggt ttg act ggt caa att teg aac gac ate att etc att gac tat act 240 
Gly Leu Thr Gly Gin He Ser Asn Asp He He Leu He Asp Tyr Thr 
65 70 75 80 

gat ttc gtc aga ctt acg gtt aag cac ccc age cag atg gee tgg tga 288 
Asp Phe Val Arg Leu Thr Val Lys His Pro Ser Gin Met Ala Trp * 
85 90 95 



<210> 120 
<211> 360 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (360) 

<400> 120 

atg aaa cga att gcg ttt gtt ttt tct act gca cct cat ggt aca gec 48 
Met Lys Arg He Ala Phe Val Phe Ser Thr Ala Pro His Gly Thr Ala 
1 5 10 15 

gca ggc egg gaa ggt tta gat get tta ctg gca act tec gca tta act 96 
Ala Gly Arg Glu Gly Leu Asp Ala Leu Leu Ala Thr Ser Ala Leu Thr 
20 25 30 

gac gat ctg get gtc ttc ttt ata get gat ggc gtt ttt cag ctg ctg 144 
Asp Asp Leu Ala Val Phe Phe He Ala Asp Gly Val Phe Gin Leu Leu 
35 40 45 

cca gga caa aag ccc gat gca gtg ctg gcg cgt gat tac att gec act 192 
Pro Gly Gin Lye Pro Asp Ala Val Leu Ala Arg Asp Tyr He Ala Thr 
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50 55 60 

ttt aaa ttg ttg ggt ctg tac gac att gaa cag tgc tgg gtt tgt gcg 240 
Phe Lys Leu Leu Gly Leu Tyr Asp lie Glu Gin Cys Trp Val Cys Ala 
65 70 75 80 

get tea ctg cgc gaa cgc ggg tta gat ccg cag aca ccc ttt gtt gtc 288 
Ala Ser Leu Arg Glu Arg Gly Leu Asp Pro Gin Tar Pro Phe Val Val 
85 90 95 

gaa gec acg ccg etc gaa gca gat gee tta cgc cgc gaa etc gec aac 336 
Glu Ala Thr Pro Leu Glu Ala Asp Ala Leu Arg Arg Glu Leu Ala Asn 
100 105 110 

tac gat gtt att ttg agg ttt tga 360 
Tyr Asp Val lie Leu Arg Phe * 
115 



<210> 121 
<211> 387 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (387) 

<400> 121 

atg cgt ttt gee ate gtg gtg acc ggg cca gca tac ggt acg caa cag 48 
Met Arg Phe Ala lie Val Val Thr Gly Pro Ala Tyr Gly Thr Gin Gin 
15 10 15 

gcg agt agt get ttt cag ttt gcg cag gcg ctg ata gca gat ggc cat 96 
Ala Ser Ser Ala Phe Gin Phe Ala Gin Ala Leu lie Ala Asp Gly His 
20 25 30 

gag tta age age gtc ttt ttc tat egg gaa ggg gtc tat aac get aac 144 
Glu Leu Ser Ser Val Phe Phe Tyr Arg Glu Gly Val Tyr Asn Ala Asn 
35 40 45 

caa ttg acc tct ccg gca agt gac gaa ttt gac etc gta egg gec tgg 192 
Gin Leu Thr Ser Pro Ala Ser Asp Glu Phe Asp Leu Val Arg Ala Trp 
50 55 60 

caa caa ctg aat gcg caa cat ggt gtg gcg ctg aat ate tgc gta gcg 240 
Gin Gin Leu Asn Ala Gin His Gly Val Ala Leu Asn lie Cys Val Ala 
65 70 75 80 

gca gca tta cgc cgt ggc gtt gtt gat gaa acg gag gec gga aga ctg 288 
Ala Ala Leu Arg Arg Gly Val Val Asp Glu Thr Glu Ala Gly Arg Leu 
85 90 95 

ggg ctg get teg tea aac ctt cag cag gga ttt acc tta age gga ctt 336 
Gly Leu Ala Ser Ser Asn Leu Gin Gin Gly Phe Thr Leu Ser Gly Leu 
100 105 110 

9SS ctg gcg gaa gee teg ctg acc tgt gac agg gtg gta cag ttc 384 
Gly Ala Leu Ala Glu Ala Ser Leu Thr Cys Asp Arg Val Val Gin Phe 
115 120 125 
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tga 387 
* 



<210> 122 








<211> 735 








<212> DNA 








<213> Escherichia coli 








<220> 








<221> CDS 








<222> (1) . . . (735) 








<400> 122 








gtg ttt ttt ttc atg tec agg teg 


ctt 


tta acc aae gaa ace 


agt 


Met Phe Phe Phe Met Ser Arg Ser 


Leu 


Leu Thr Asn Glu Thr 


Ser 


1 5 




10 


15 



48 



ttg gat tta ctg gat caa cgt cct ttc gac cag acc gat ttt gat att 96 
Leu Asp Leu Leu Asp Gin Arg Pro Phe Asp Gin Thr Asp Phe Asp lie 
20 25 30 

ctg aaa tec tac gaa gcg gtg gtg gac ggg tta gcg atg ctt att ggc 144 
Leu Lys Ser Tyr Glu Ala Val Val Asp Gly Leu Ala Met Leu lie Gly 
35 40 45 

tec cac tgt gaa ate gtt ttg cac tct ttg cag gat eta aaa tgt tea 192 
Ser His Cys Glu He Val Leu His Ser Leu Gin Asp Leu Lys Cys Ser 
50 55 60 

gee att cgc att get aae ggt gaa cat aca ggc egg aag att ggt teg 240 
Ala He Arg He Ala Asn Gly Glu His Thr Gly Arg Lys He Gly Ser 
65 70 75 80 

cca att act gac ctg gcg eta cgt atg ctg cac gat atg acg gga gcg 288 
Pro lie Thr Asp Leu Ala Leu Arg Met Leu His Asp Met Thr Gly Ala 
85 90 95 

gat age age gtt tct aaa tgc tac ttt act cgc gee aaa age ggc gta 336 
Asp Ser Ser Val Ser Lys Cys Tyr Phe Thr Arg Ala Lys Ser Gly Val 
100 105 110 

tta atg aag tec ctg act ate gcg att cgt aac cgc gaa cag cgt gta 384 
Leu Met Lys Ser Leu Thr He Ala lie Arg Asn Arg Glu Gin Arg Val 
115 120 125 

att ggt ctg ctg tgc ate aat atg aat ctt gat gtt ccc ttc teg cag 432 
He Gly Leu Leu Cys lie Asn Met Asn Leu Asp Val Pro Phe Ser Gin 
130 135 140 

att atg age acc ttt gtg ccg cca gaa acc ccg gat gtc ggt tea age 480 
He Met Ser Thr Phe Val Pro Pro Glu Thr Pro Asp Val Gly Ser Ser 
145 150 155 160 

gtc aac ttt gec tct tct gtt gaa gat ctg gtt acc caa acg ctg gag 528 
Val Asn Phe Ala Ser Ser Val Glu Asp Leu Val Thr Gin Thr Leu Glu 
165 " 170 175 

ttc acc ate gaa gaa gtg aat gee gat cgc aat gtt tct aat aac gee 576 
Phe Thr He Glu Glu Val Asn Ala Asp Arg Asn Val Ser Asn Asn Ala 
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180 185 190 

aaa aat cgt cag ate gtg ctg aat etc tac gag aaa ggg ate ttc gat 624 
Lys Asn Arg Gin He Val Leu Asn Leu Tyr Glu Lys Gly He Phe Asp 
195 200 205 

att aaa gat gcg ate aac cag gtt get gac cgc ctg aac ate tec aaa 672 
He Lye Asp Ala He Asn Gin Val Ala Asp Arg Leu Asn He Ser Lys 
210 215 220 

cac act gtc tat etc tac ate cgc cag ttc aag age ggt gat ttc cag 720 
His Thr Val Tyr Leu Tyr He Arg Gin Phe Lys Ser Gly Asp Phe Gin 
225 230 235 240 

ggg caa gat aag taa 735 
Gly Gin Asp Lys * 



<210> 123 
<211> 255 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (255) 

<400> 123 

atg acc gat aaa ate cgt act ctg caa ggt cgc gtt gtt age gac aaa 48 
Met Thr Asp Lys He Arg Thr Leu Gin Gly Arg Val Val Ser Asp Lys 
1 5 10 15 

atg gag aaa tec att gtt gtt get ate gaa cgt ttt gtg aaa cac ccg 96 
Met Glu Lys Ser He Val Val Ala He Glu Arg Phe Val Lys His Pro 
20 25 30 

ate tac ggt aaa ttc ate aag cgt acg acc aaa ctg cac gta cat gac 144 
He Tyr Gly Lys Phe He Lys Arg Thr Thr Lys Leu His Val His Asp 
35 40 45 

gag aac aac gaa tgc ggt ate ggt gac gtg gtt gaa ate cgc gaa tgc 192 
Glu Asn Asn Glu Cys Gly He Gly Asp Val Val Glu He Arg Glu Cys 
50 55 60 

cgt ccg ctg tec aag act aaa tec tgg acg ctg gtt cgc gtt gta gag 240 
Arg Pro Leu Ser Lys Thr Lys Ser Trp Thr Leu Val Arg Val Val Glu 
65 70 75 80 

aaa gcg gtt ctg taa 255 
Lys Ala Val Leu * 



<210> 124 
<211> 192 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . . . (192) 
<400> 124 

atg aaa gca aaa gag ctg cgt gag aag age gtt gaa gag ctg aac acc 48 
Met Lys Ala Lys Glu Leu Arg Glu Lys Ser Val Glu Glu Leu Asn Thr 
15 10 15 

gag ctg ctg aac ctg ctg cgt gag cag ttc aac ctg cgt atg cag get 96 
Glu Leu Leu Asn Leu Leu Arg Glu Gin Phe Asn Leu Arg Met Gin Ala 
20 25 30 

gca agt ggc cag ctg caa cag tct cac ctg ttg aag caa gtg cgt cgc 144 
Ala Ser Gly Gin Leu Gin Gin Ser His Leu Leu Lys Gin Val Arg Arg 
35 40 45 

gat gtc gca cgc gtt aag act tta ctg aac gag aag gcg ggt gcg taa 192 
Asp Val Ala Arg Val Lys Thr Leu Leu Asn Glu Lys Ala Gly Ala * 
50 55 60 



/ 

<210> 125 
<211> 411 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (411) 

<400> 125 

atg tta caa cca aag cgt aca aaa ttc cgt aaa atg cac aaa ggc cgt 48 

Met Leu Gin Pro Lys Arg Thr Lys Phe Arg Lys Met His Lys Gly Arg 
1 5 10 15 

aac cgc ggt ctg gcg cag ggt acg gat gtt age ttc ggc age ttc ggt 96 
Asn Arg Gly Leu Ala Gin Gly Thr Asp Val Ser Phe Gly Ser Phe Gly 
20 25 30 

ctg aaa get gtt ggc cgt ggt cgt ctg act gee cgt cag ate gaa gca 144 
Leu Lys Ala Val Gly Arg Gly Arg Leu Thr Ala Arg Gin lie Glu Ala 
35 40 45 

gca cgt cgt get atg acc cgt gca gtt aag cgt caa ggt aag ate tgg 192 
Ala Arg Arg Ala Met Thr Arg Ala Val Lys Arg Gin Gly Lys lie Trp 
50 55 60 

ate cgt gtg ttc ccg gac aaa ccg ate act gaa aag ccg ctg gca gtg 240 
lie Arg Val Phe Pro Asp Lys Pro He Thr Glu Lys Pro Leu Ala Val 
65 70 75 80 

cgt atg ggt aaa ggt aaa ggt aac gtg gag tat tgg gtt gec ttg att 288 
Arg Met Gly Lys Gly Lys Gly Asn Val Glu Tyr Trp Val Ala Leu He 
85 90 95 

cag ccg ggt aaa gtc ctg tat gaa atg gac ggt gtt ccg gaa gag ctg 336 
Gin Pro Gly Lys Val Leu Tyr Glu Met Asp Gly Val Pro Glu Glu Leu 
100 105 110 

gee cgt gaa gca ttc aag ctg gca gca gcg aaa ctg ccg att aaa acc 384 
Ala Arg Glu Ala Phe Lys Leu Ala Ala Ala Lys Leu Pro He Lys Thr 
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115 120 125 

acc ttt gta act aag acg gtg atg taa 411 
Thr Phe Val Thr Lys Thr Val Met * 
130 135 



<210> 126 
<211> 702 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (702) 

<400> 126 

atg ggt cag aaa gta cat cct aat ggt att cgc ctg ggt att gta aaa 48 
Met Gly Gin Lys Val His Pro Asn Gly lie Arg Leu Gly lie Val Lys 
1 5 10 15 

cca tgg aac tct acc tgg ttt gcg aac acc aaa gaa ttc get gac aac 96 
Pro Trp Asn Ser Thr Trp Phe Ala Asn Thr Lys Glu Phe Ala Asp Asn 
20 25 30 

ctg gac age gat ttt aaa gta cgt cag tac ctg act aag gaa ctg get 144 
Leu Asp Ser Asp Phe Lys Val Arg Gin Tyr Leu Thr Lys Glu Leu Ala 
35 40 45 

aaa gcg tec gta tct cgt ate gtt ate gag cgt ccg get aag age ate 192 
Lys Ala Ser Val Ser Arg He Val He Glu Arg Pro Ala Lys Ser He 
50 55 60 

cgt gta aec att cac act get cgc ccg ggt ate gtt ate ggt aaa aaa 240 
Arg Val Thr He His Thr Ala Arg Pro Gly He Val He Gly Lys Lys 
65 70 75 80 

ggt gaa gac gta gaa aaa ctg cgt aag gtc gta gcg gac ate get ggc 288 
Gly Glu Asp Val Glu Lys Leu Arg Lys Val Val Ala Asp He Ala Gly 
85 90 95 

gtt cct gca cag ate aac ate gee gaa gtt cgt aag cct gaa ctg gac 336 
Val Pro Ala Gin He Asn He Ala Glu Val Arg Lys Pro Glu Leu Asp 
100 105 110 

gca aaa ctg gtt get gac age ate act tct cag ctg gaa cgt cgc gtt 384 
Ala Lys Leu Val Ala Asp Ser He Thr Ser Gin Leu Glu Arg Arg Val 
115 120 125 

atg ttc cgt cgt get atg aag cgt get gta cag aac gca atg cgt ctg 432 
Met Phe Arg Arg Ala Met Lys Arg Ala Val Gin Asn Ala Met Arg Leu 
130 135 140 

ggc get aaa ggt att aaa gtt gaa gtt age ggc cgt ctg ggc ggc gcg 480 
Gly Ala Lys Gly He Lys Val Glu Val Ser Gly Arg Leu Gly Gly Ala 
145 150 155 " 160 

gaa ate gca cgt acc gaa tgg tac cgc gaa ggt cgc gta ccg ctg cac 528 
Glu He Ala Arg Thr Glu Trp Tyr Arg Glu Gly Arg Val Pro Leu His 
165 170 175 

\ 
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act ctg cgt get gac ate gac tac aac ace tct gaa gcg cac ace act 576 
Thr Leu Arg Ala Asp lie Asp Tyr Asn Thr Ser Glu Ala His Thr Thr 
180 185 190 

tac ggt gta ate ggc gtt aaa gtg tgg ate ttc aaa ggc gag ate ctg 624 
Tyr Gly Val lie Gly Vafl Lys Val Trp He Phe Lys Gly Glu He Leu 
195 200 205 

OTt ggt atg get get gtt gaa caa ccg gaa aaa ccg get get cag cct 672 
Gly Gly Met Ala Ala Val Glu Gin Pro Glu Lys Pro Ala Ala Gin Pro 
210 215 220 

aaa aag cag cag cgt aaa ggc cgt aaa taa 702 
Lys Lys Gin Gin Arg Lys Gly Arg Lys * 
225 230 



<210> 127 
<211> 333 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (333) 

<400> 127 

atg gaa act ate get aaa cat cgc cat get cgt tct tct get cag aag 48 
Met Glu Thr He Ala Lys His Arg His Ala Arg Ser Ser Ala Gin Lys 
1 5 10 15 

gtt cgc ctt gtt get gac ctg att cgc ggt aag aaa gtg teg cag get 96 
Val Arg Leu Val Ala Asp Leu He Arg Gly Lys Lys Val Ser Gin Ala 
20 25 30 

ct ST 9 at at t ttg acc tac acc aac aa 9 aaa STcg get gta ctg gtc aag 144 
Leu Asp He Leu Thr Tyr Thr Asn Lys Lys Ala Ala Val Leu Val Lys 
35 40 45 

aaa gtt ctg gaa tct gec att get aac get gaa cac aac gat ggc get 192 
Lys Val Leu Glu Ser Ala He Ala Asn Ala Glu His Asn Asp Gly Ala 
50 55 60 

gac att gac gat ctg aaa gtt acg aaa att ttc gta gac gaa ggc ccg 240 
Asp He Asp Asp Leu Lys Val Thr Lys He Phe Val Asp Glu Gly Pro 
65 70 75 80 

age atg aag cgc att atg ccg cgt gca aaa ggt cgt gca gat cgc ate 288 
Ser Met Lys Arg He Met Pro Arg Ala Lys Gly Arg Ala Asp Arg He 
85 90 95 

ctg aag cgc acc age cac ate act gtg gtt gtg tec gat cgc tga 333 
Leu Lys Arg Thr Ser His He Thr Val Val Val Ser Asp Arg * 
100 105 110 



<210> 128 
<211> 279 
<212> DNA 

<213> Escherichia coli 



-61- 



WO 01/48209 



PCT/US00/34419 



<220> 

<22l> CDS 

<222> (1) . . . (279) 

<400> 128 

atg cca cgt tct etc aag aaa ggt cct ttt att gac ctg cac ttg ctg 48 
Met Pro Arg Ser Leu Lys Lys Gly Pro Phe lie Asp Leu His Leu Leu 
15 10 15 

aag aag gta gag aaa gcg gtg gaa age gga gac aag aag ccc ctg cgc 96 
Lys Lys Val Glu Lys Ala Val Glu Ser Gly Asp Lys Lys Pro Leu Arg 
20 25 30 

act tgg tec cgt cgt tea acg ate ttt cct aac atg ate ggt ttg ace 144 
Thr Trp Ser Arg Arg Ser Thr lie Phe Pro Asn Met lie Gly Leu Thr 
35 40 45 

ate get gtc cat aat ggt cgt cag cac gtt ccg gta ttt gta acc gac 192 
lie Ala Val His Asn Gly Arg Gin His Val Pro Val Phe Val Thr Asp 
50 55 60 

gaa atg gtt ggt cac aaa ctg ggt gaa ttc gca ccg act cgt act tat 240 
Glu Met Val Gly His Lys Leu Gly Glu Phe Ala Pro Thr Arg Thr Tyr 
65 70 75 80 

cgc ggc cac get get gat aaa aaa gcg aag aag aaa taa 279 
Arg Gly His Ala Ala Asp Lys Lys Ala Lys Lys Lys * 
85 90 



<210> 129 
<211> 822 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (822) 

<400> 129 

atg gca gtt gtt aaa tgt aaa ccg aca tct ccg ggt cgt cgc cac gta 48 
Met Ala Val Val Lys Cys Lys Pro Thr Ser Pro Gly Arg Arg His Val 
15 10 15 

gtt aaa gtg gtt aac cct gag ctg cac aag ggc aaa cct ttt get ccg 96 
Val Lys Val Val Asn Pro Glu Leu His Lys Gly Lys Pro Phe Ala Pro 
20 25 30 

ttg ctg gaa aaa aac age aaa tec ggt ggt cgt aac aac aat ggc cgt 144 
Leu Leu Glu Lys Asn Ser Lys Ser Gly Gly Arg Asn Asn Asn Gly Arg 
35 40 45 

ate acc act cgt cat ate ggt ggt ggc cac aag cag get tac cgt att 192 
lie Thr Thr Arg His lie Gly Gly Gly His Lys Gin Ala Tyr Arg He 
50 55 60 



gtt gac ttc aaa cgc aac aaa gac ggt ate ccg gca gtt gtt gaa cgt 240 
Val Asp Phe Lys Arg Asn Lys Asp Gly He Pro Ala Val Val Glu Arg 
65 . 70 75 80 
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ctt gag tac gat ccg aac cgt tec gcg aac ate gcg ctg gtt ctg tac 288 
Leu Glu Tyr Asp Pro Asn Arg Ser Ala Asn lie Ala Leu Val Leu Tyr 
85 90 95 

aaa gac ggt gaa cgc cgt tac ate ctg gee cct aaa ggc ctg aaa get 336 
Lys Asp Gly Glu Arg Arg Tyr lie Leu Ala Pro Lys Gly Leu Lys Ala 
100 105 110 

ggc gac cag att cag tct ggc gtt gat get gca ate aaa cca ggt aac 384 
Gly Asp Gin lie Gin Ser Gly Val Asp Ala Ala lie Lys Pro Gly Asn 
115 120 125 

acc ctg ccg atg cgc aac ate ccg gtt ggt tct act gtt cat aac gta 432 
Thr Leu Pro Met Arg Asn lie Pro Val Gly Ser Thr Val His Asn Val 
130 135 140 

gaa atg aaa cca ggt aaa ggc ggt cag ctg gca cgt tec get ggt act 480 
Glu Met Lys Pro Gly Lys Gly Gly Gin Leu Ala Arg Ser Ala Gly Thr 
145 150 155 160 

tac gtt cag ate gtt get cgt gat ggt get tat gtc acc ctg cgt ctg 528 
Tyr Val Gin lie Val Ala Arg Asp Gly Ala Tyr Val Thr Leu Arg Leu 
165 170 175 

cgt tct ggt gaa atg cgt aaa gta gaa gca gac tgc cgt gca act ctg 576 
Arg Ser Gly Glu Met Arg Lys Val Glu Ala Asp Cys Arg Ala Thr Leu 
180 185 190 

ggc gaa gtt ggc aat get gag cat atg ctg cgc gtt ctg ggt aaa gca 624 
Gly Glu Val Gly Asn Ala Glu His Met Leu Arg Val Leu Gly Lys Ala 
195 200 205 

ggt get gca cgc tgg cgt ggt gtt cgt ccg acc gtt cgc ggt acc gcg 672 
Gly Ala Ala Arg Trp Arg Gly Val Arg Pro Thr Val Arg Gly Thr Ala 
210 215 220 

atg aac ccg gta gac cac cca cat ggt ggt ggt gaa ggt cgt aac ttt 720 
Met Asn Pro Val Asp His Pro His Gly Gly Gly Glu Gly Arg Asn Phe 
225 230 235 240 

ggt aag cac ccg gta act ccg tgg ggc gtt cag acc aaa ggt aag aag 768 
Gly Lys His Pro Val Thr Pro Trp Gly Val Gin Thr Lys Gly Lys Lys 
245 250 255 

acc cgc age aac aag cgt act gat aaa ttc ate gta cgt cgc cgt age 816 
Thr Arg Ser Asn Lys Arg Thr Asp Lys Phe He Val Arg Arg Arg Ser 
260 265 270 

aaa taa 822 
Lys * 



<210> 130 
<211> 303 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (303) 
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<400> 130 

atg att cgt gaa gaa cgt ctg ctg aag gtg ctg cgt gca ccg cac gtt 48 
Met lie Arg Glu Glu Arg Leu Leu Lys Val Leu Arg Ala Pro His Val 
15 10 15 

tct gaa aaa gcg tct act gcg atg gaa aaa tec aac acc ate gta etc 96 
Ser Glu Lys Ala Ser Thr Ala Met Glu Lys Ser Asn Thr lie Val Leu 
20 25 30 

aaa gtt get aaa gac gcg acc aaa gca gaa ate aaa get get gtg cag 144 
Lys Val Ala Lys Asp Ala Thr Lys Ala Glu lie Lys Ala Ala Val Gin 
35 40 45 

aaa ctg ttt gaa gtc gaa gtc gaa gtc gtt aac acc ctg gta gtt aaa 192 
Lys Leu Phe Glu Val Glu Val Glu Val Val Asn Thr Leu Val Val Lys 
50 55 60 

ggg aaa gtt aaa cgt cac gga cag cgt ate ggt cgt cgt age gac tgg 240 
Gly Lys Val Lys Arg His Gly Gin Arg lie Gly Arg Arg Ser Asp Trp 
65 70 75 80 

aaa aaa get tac gtc acc ctg aaa gaa ggc cag aat ctg gac ttc gtt 288 
Lys Lys Ala Tyr Val Thr Leu Lys Glu Gly Gin Asn Leu Asp Phe Val 
85 90 95 

ggc ggc get gag taa 303 
Gly Gly Ala Glu * 
100 



<210> 131 
<211> 606 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . • . (606) 

<400> 131 

atg gaa tta gta ttg aaa gac gcg cag age gcg ctg act gtt tec gaa 48 
Met Glu Leu Val Leu Lys Asp Ala Gin Ser Ala Leu Thr Val Ser Glu 
15 10 15 

act acc ttc ggt cgt gat ttc aac gaa gcg ctg gtt cac cag gtt gtt 96 
Thr Thr Phe Gly Arg Asp Phe Asn Glu Ala Leu Val His Gin Val Val 
20 25 30 

gtt get tat gca get ggt get cgt cag ggt act cgt get cag aag act 144 
Val Ala Tyr Ala Ala Gly Ala Arg Gin Gly Thr Arg Ala Gin Lys Thr 
35 40 45 

cgt get gaa gta act ggt tec ggt aaa aaa ccg tgg cgc cag aaa ggc 192 
Arg Ala Glu Val Thr Gly Ser Gly Lys Lys Pro Trp Arg Gin Lys Gly 
50 55 60 

acc ggc cgt gcg cgt tct ggt tct ate aag age ccg ate tgg cgt tct 240 
Thr Gly Arg Ala Arg Ser Gly Ser lie Lys Ser Pro lie Trp Arg Ser 
65 70 75 80 
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ggt ggc gtg acc ttt get get cgt ccg cag gac cac agt caa aaa gtt 288 
Gly Gly Val Thr Phe Ala Ala Arg Pro Gin Asp His Ser Gin Lys Val 
85 90 95 

aac aag aag atg tac cgc ggc gcg ctg aaa age ate ctg tec gaa ctg 336 
Asn Lys Lys Met Tyr Arg Gly Ala Leu Lys Ser lie Leu Ser Glu Lieu 
100 105 110 

gta cgt cag gat cgt ctg ate gtt gtc gag aag ttc tct gta gaa gcg 384 
Val Arg Gin Asp Arg Leu lie Val Val Glu Lys Phe Ser Val Glu Ala 
115 120 125 

ccg aaa act aag ctg ctg gca cag aaa ctg aaa gac atg get ctg gaa 432 
Pro Lys Thr Lys Leu Leu Ala Gin Lys Leu Lys Asp Met Ala Leu Glu 
130 135 140 

gat gtg ctg ate ate acc ggt gag ctg gac gaa aac ctg ttc ctg get 480 
Asp Val Leu He He Thr Gly Glu Leu Asp Glu Asn Leu Phe Leu Ala 
145 150 155 160 



gcg cgc aac ctg cac aag gtt gac gta cgc gat gca act ggt ate gac 
Ala Arg Asn Leu His Lys Val Asp Val Arg Asp Ala Thr Gly He Asp 
165 170 175 



528 



ccg gtt age ctg ate gee ttc gac aaa gtc gta atg act get gat get 576 
Pro Val Ser Leu He Ala Phe Asp Lys Val Val Met Thr Ala Asp Ala 
180 185 190 

gtt aag caa gtt gag gag atg ctg gca tga 606 
Val Lys Gin Val Glu Glu Met Leu Ala * 
195 200 



<210> 132 
<211> 630 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (630) 

<400> 132 

atg att ggt tta gtc ggt aaa aaa gtg ggt atg acc cgt ate ttc aca 48 
Met He Gly Leu Val Gly Lys Lys Val Gly Met Thr Arg He Phe Thr 
1 5 10 15 

gaa gac ggc gtt tct ate cca gta acc gta ate gaa gtt gaa gca aac 96 
Glu Asp Gly Val Ser He Pro Val Thr Val He Glu Val Glu Ala Asn 
20 25 30 

cgc gtt act cag gtt aaa gac ctg get aac gat ggc tac cgt get att 144 
Arg Val Thr Gin Val Lys Asp Leu Ala Asn Asp Gly Tyr Arg Ala He 
35 40 45 

cag gtg acc acc ggt get aaa aaa get aac cgt gtg acc aag cct gaa 192 
Gin Val Thr Thr Gly Ala Lys Lys Ala Asn Arg Val Thr Lys Pro Glu 
50 55 60 

get ggc cac ttc get aaa get ggc gta gaa get ggc cgt ggt ctg tgg 240 
Ala Gly His Phe Ala Lys Ala Gly Val Glu Ala Gly Arg Gly Leu Trp 
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65 70 75 80 

gaa t'tc cgc ctg get gaa ggc gaa gag ttc act gta ggt cag age att 288 
Glu Phe Arg Leu Ala Glu Gly Glu Glu Phe Thr Val Gly Gin Ser lie 
85 90 95 

age gtt gaa ctg ttt get gac gtt aaa aaa gtt gac gta act ggc acc 336 
Ser Val Glu Leu Phe Ala Asp Val Lys Lys Val Asp Val Thr Gly Thr 
100 105 110 

tct aaa ggt aaa ggt ttc gca ggt acc gtt aag cgc tgg aac ttc cgt 384 
Ser Lys Gly Lys Gly Phe Ala Gly Thr Val Lys Arg Trp Asn Phe Arg 
115 120 125 

acc cag gac get act cac ggt aac tec ttg tct cac cgc gtt ccg ggt 432 
Thr Gin Asp Ala Thr His Gly Asn Ser Leu Ser His Arg Val Pro Gly 
130 135 140 

tct ate ggt cag aac cag act ccg ggc aaa gtg ttc aaa ggc aag aaa 480 
Ser lie Gly Gin Asn Gin Thr Pro Gly Lys Val Phe Lys Gly Lys Lys 
145 150 155 160 

atg gca ggt cag atg ggt aac gaa cgt gta acc gtt cag age ctt gac 528 
Met Ala Gly Gin Met Gly Asn Glu Arg Val Thr Val Gin Ser Leu Asp 
165 170 175 

gta gta cgc gtt gac get gag cgc aac ctg ctg ctg gtt aaa ggt get 576 
Val Val Arg Val Asp Ala Glu Arg Asn Leu Leu Leu val Lys Gly Ala 
180 185 190 

gtc ccg ggt gca acc ggt age gac ctg ate gtt aaa cca get gtg aag 624 
Val Pro Gly Ala Thr Gly Ser Asp Leu lie Val Lys Pro Ala Val Lys 
195 200 205 

gcg taa 63 0 

Ala * 



<210> 133 
<211> 312 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (312) 

<400> 133 

atg cag aac caa aga ate cgt ate 
Met Gin Asn Gin Arg He Arg He 
1 5 

ctg ate gat caa gca acc gcg gaa 
Leu He Asp Gin Ala Thr Ala Glu 
20 

ggt gcg cag gtc cgt ggt ccg ate 
Gly Ala Gin Val Arg Gly Pro He 
35 40 



cgc ctg aaa gcg ttt gat cat cgt 48 
Arg Leu Lys Ala Phe Asp His Arg 
10 15 

ate gtc gag act gee aag cgc act 96 
He Val Glu Thr Ala Lys Arg Thr 
25 30 

ccg ctg ccg aca cgc aaa gag cgc 144 
Pro Leu Pro Thr Arg Lys Glu Arg 
45 
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ttc act gtt ctg ate tec ccg cac gtc aac aaa gac gcg cgc gat cag 192 
Phe Thr Val Leu lie Ser Pro His Val Asn Lys Asp Ala Arg Asp Gin 
50 55 60 

tac gaa ate cgt act cac ttg cgt ctg gtt gac ate gtt gag cca acc 240 
Tyr Glu lie Arg Thr His Leu Arg Leu Val Asp He Val Glu Pro Thr 
65 70 75 80 

gag aaa acc gtt gat get ctg atg cgt ctg gat ctg get gee ggt gta 288 
Glu Lys Thr Val Asp Ala Leu Met Arg Leu Asp Leu Ala Ala Gly Val 
85 90 95 

gac gtg cag ate age ctg ggt taa 312 
Asp Val Gin He Ser Leu Gly * 
100 



<210> 134 








<211> 315 








<212> DNA 








<213> Escherichia coli 








<220>' 








<221> CDS 








<222> (1) . . . (315) 








<400> 134 








atg ate cgc aaa gee ttt 


gtc 


atg cag gta aac 


ccc gac gec cac 


Met lie Arg Lys Ala Phe 


Val 


Met Gin Val Asn 


Pro Asp Ala His 


1 " 5 




10 


15 



48 



gag tat cag cgt egg cat aat ccc ate tgg cca gaa ctg gaa gca gtg 96 
Glu Tyr Gin Arg Arg His Asn Pro lie Trp Pro Glu Leu Glu Ala Val 
20 25 30 

ctg aaa tct cac ggt gcg cat aac tac gee ate tat etc gac aaa gcg 144 
Leu Lys Ser His Gly Ala His Asn Tyr Ala He Tyr Leu Asp Lys Ala 
35 40 45 

cgt aat ctg ctg ttt gec atg gta gag att gaa tct gaa gaa cgc tgg 192 
Arg Asn Leu Leu Phe Ala Met Val Glu He Glu Ser Glu Glu Arg Trp 
50 55 60 

aat gcg gtt gee age act gat gtt tgc caa cgt tgg tgg aaa tat atg 240 
Asn Ala Val Ala Ser Thr Asp Val Cya Gin Arg Trp Trp Lys Tyr Met 
65 70 75 B0 

acc gat gtt atg ccc get aac ccg gat aac age ccg gtg agt age gag 288 
Thr Asp Val Met Pro Ala Asn Pro Asp Asn Ser Pro Val Ser Ser Glu 
85 90 95 

ctg caa gaa gtg ttt tac ctg ccg taa 315 
Leu Gin Glu Val Phe Tyr Leu Pro * 
100 



<210> 135 
<211> 864 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (864) 

<400> 135 

atg ate cgc agt atg acc gec tac gec egg cgt gaa ate aag ggt gaa 48 
Met He Arg Ser Met Thr Ala Tyr Ala Arg Arg Glu He Lys Gly Glu 
15 10 15 

tgg ggg age gca acc tgg gaa atg cgc teg gta aac cag cgt tat ctg 96 
Trp Gly Ser Ala Thr Trp Glu Met Arg Ser Val Asn Gin Arg Tyr Leu 
20 25 30 

gaa act tac ttt cgt ctg ccg gag cag ttc cgt age ctt gaa cct gtc 144 
Glu Thr Tyr Phe Arg Leu Pro Glu Gin Phe Arg Ser Leu Glu Pro Val 
35 40 45 

gtt cgc gag cgt att cgt tct cgc ctg acg cgc ggt aaa gtg gaa tgt 192 
Val Arg Glu Arg He Arg Ser Arg Leu Thr Arg Gly Lys Val Glu Cys 
50 55 60 

acc ctg cgc tat gag cca gat gtt age gcg caa ggt gag ctg ate etc 240 
Thr Leu Arg Tyr Glu Pro Asp Val Ser Ala Gin Gly Glu Leu He Leu 
65 70 75 80 

aac gaa aaa ctg get aaa cag ctg gta act gee gcg aac tgg gta aaa 288 
Asn Glu Lys Leu Ala Lys Gin Leu Val Thr Ala Ala Asn Trp Val Lys 
85 90 95 

atg cag agt gac gaa ggg gaa ate aac ccg gtt gat att eta cgc tgg 336 
Met Gin Ser Asp Glu Gly Glu He Asn Pro Val Asp He Leu Arg Trp 
100 105 110 

ccg ggc gtg atg gca gee cag gag cag gat ctt gac gec att gee get 384 
Pro Gly Val Met Ala Ala Gin Glu Gin Asp Leu Asp Ala He Ala Ala 
115 120 125 

gaa att etc gcg gcg ctg gat ggt acg ctg gac gac ttt att gtc gcg 432 
Glu He Leu Ala Ala Leu Asp Gly Thr Leu Asp Asp Phe He Val Ala 
130 135 140 

cgc gaa acc gaa ggt cag gca ctg aaa gca ttg ate gag cag cgt ctg 480 
Arg Glu Thr Glu Gly Gin Ala Leu Lys Ala Leu He Glu Gin Arg Leu 
145 150 155 160 

gaa ggc gtc acc gec gaa gtg gtc aaa gtc cgc tec cat atg ccg gaa 528 
Glu Gly Val Thr Ala Glu Val Val Lys Val Arg Ser His Met Pro Glu 
165 170 175 

ate ctg caa tgg cag cgt gag cgt ctg gtc gcg aag ctg gaa gat get 576 
He Leu Gin Trp Gin Arg Glu Arg Leu Val Ala Lys Leu Glu Asp Ala 
180 185 190 

cag gtg caa ctg gaa aac aac cgt ctg gag cag gaa ctg gtt ctg ctg 624 
Gin Val Gin Leu Glu Asn Asn Arg Leu Glu Gin Glu Leu Val Leu Leu 
195 200 205 

gca caa cga att gac gtt gec gaa gaa ctg gat cgc etc gaa gcg cat 672 
Ala Gin Arg He Asp Val Ala Glu Glu Leu Asp Arg Leu Glu Ala His 
210 215 220 
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gtc aaa gag acc tac aac att ctg aag aaa aaa gaa gcg gtt ggt cgt 720 
Val Lys Glu Thr Tyr Asn lie Leu Lys Lys Lys Glu Ala Val Gly Arg 
225 230 235 240 

cgt ctg gat ttt atg atg cag gag ttc aac cgc gag teg aac act ctt 768 
Arg Leu Asp Phe Met Met Gin Glu Phe Asn Arg Glu Ser Asn Thr Leu 
245 250 255 

gcg teg aag tct ate aat gec gaa gtg aca aac tec gec ate gag ctg 816 
Ala Ser Lys Ser lie Asn Ala Glu Val Thr Asn Ser Ala lie Glu Leu 
260 265 270 

aaa gtg ttg att gag cag atg cgc gag cag att cag aac ate gaa taa 864 
Lys Val Leu He Glu Gin Met Arg Glu Gin He Gin Asn He Glu * 
275 280 285 



<210> 136 
<211> 492 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (492) 

<400> 136 

atg age aca gaa aca att gaa ata ttc aat aat agt gat gaa tgg gca 48 

Met Ser Thr Glu Thr lie Glu He Phe Asn Asn Ser Asp Glu Trp Ala 
1 5 10 15 

aat caa eta aaa cac gca tta teg aaa gga gaa aat ctg gca tta eta 96 
Asn Gin Leu Lys His Ala Leu Ser Lys Gly Glu Asn Leu Ala Leu Leu 
20 25 30 

cat ggt tta act cct gat ate ctt gat aga ata tat gca tat gca ttc 144 
His Gly Leu Thr Pro Asp He Leu Asp Arg He Tyr Ala Tyr Ala Phe 
35 40 45 

gac tac cat gaa aag ggt aat ata acg gac gca gaa att tat tat aaa 192 
Asp Tyr His Glu Lys Gly Asn He Thr Asp Ala Glu He Tyr Tyr Lys 
50 55 60 

ttc ttg tgc att tat gcg ttc gaa aat cat gag tat eta aaa gat ttt 240 
Phe Leu Cys He Tyr Ala Phe Glu Asn His Glu Tyr Leu Lys Asp Phe 
65 70 75 80 

gca tea gta tgt cag ccc aaa aag aaa tat caa caa gca tat gac ctt 288 
Ala Ser Val Cys Gin Pro Lys Lys Lys Tyr Gin Gin Ala Tyr Asp Leu 
85 90 95 

tac aaa eta agt tac aat tac ttc ccg tat gat gac tat tea gtt att 336 
Tyr Lys Leu Ser Tyr Asn Tyr Phe Pro Tyr Asp Asp Tyr Ser Val He 
100 105 110 

tat cgt atg ggt caa tgt cag att ggg get aaa aat ate gat aac gca 384 
Tyr Arg Met Gly Gin Cys Gin He Gly Ala Lys Asn He Asp Asn Ala 
115 120 125 

atg caa tgt ttc tat cac att att aac aat tgt gag gat gat agt gtt 432 
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Met Gin Cys Phe Tyr His lie lie Asn Asn Cys Glu Asp Asp Ser Val 
130 135 140 

aag agt aaa gcg cag gca tat att gaa etc tta aac gat aat tea gaa 480 
Lys Ser Lys Ala Gin Ala Tyr lie Glu Leu lieu Asn Asp Asn Ser Glu 
145 150 155 160 

gat aat ggc taa 492 
Asp Asn Gly * 



<210> 137 
<211> 1947 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . - - (1947) 

<400> 137 

atg aat att tta ggt ttt ttc cag cga etc ggt agg gcg tta cag etc 48 
Met Asn He Leu Gly Phe Phe Gin Arg Leu Gly Arg Ala Leu Gin Leu 
15 10 15 

cct ate gcg gtg ctg ccg gtg gcg gca ctg ttg ctg cga ttc ggt cag 96 
Pro He Ala Val Leu Pro Val Ala Ala Leu Leu Leu Arg Phe Gly Gin 
20 25 30 

cca gat tta ctt aac gtt gcg ttt att gec cag gcg ggc ggt gcg att 144 
Pro Asp Leu Leu Asn Val Ala Phe He Ala Gin Ala Gly Gly Ala He 
35 40 45 

ttt gat aac etc gca tta ate ttc gee ate ggt gtg gca tec age tgg 192 
Phe Asp Asn Leu Ala Leu He Phe Ala He Gly Val Ala Ser Ser Trp 
50 55 60 

teg aaa gac age get ggt gcg gcg gcg ctg gcg ggt gcg gta ggt tac 240 
Ser Lys Asp Ser Ala Gly Ala Ala Ala Leu Ala Gly Ala Val Gly Tyr 
65 70 75 80 

ttt gtg tta acc aaa gcg atg gtg ace ate aac cca gaa att aac atg 288 
Phe Val Leu Thr Lys Ala Met Val Thr He Asn Pro Glu He Asn Met 
85 90 95 

ggt gta ctg gcg ggt ate att acc ggt ctg gtt ggt ggc gca gee tat 336 
Gly Val Leu Ala Gly He He Thr Gly Leu Val Gly Gly Ala Ala Tyr 
100 105 110 

aac cgt tgg tec gat att aaa ctg ccg gac ttc ctg age ttc ttc ggc 384 
Asn Arg Trp Ser Asp He Lys Leu Pro Asp Phe Leu Ser Phe Phe Gly 
115 120 125 

ggc aaa cgc ttt gtg ccg att gee acc gga ttc ttc tgc ctg gtg ctg 432 
Gly Lys Arg Phe Val Pro He Ala Thr Gly Phe Phe Cys Leu Val Leu 
130 135 140 

gcg gee att ttt ggt tac gtc tgg ccg ccg gta cag cac get ate cat 480 
Ala Ala He Phe Gly Tyr Val Trp Pro Pro Val Gin His Ala He His 
145 150 155 160 
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gca ggc ggc gag tgg ate gtt tct gcg ggc gcg ctg ggt tec ggt ate 528 
Ala Gly Gly Glu Trp lie Val Ser Ala Gly Ala Leu Gly Ser Gly lie 
165 170 175 

ttt ggt ttc ate aac cgt ctg ctg ate cca acc ggt ctg cat cag gta 576 
Phe Gly Phe lie Asn Arg Leu Leu lie Pro Thr Gly Leu His Gin Val 
180 185 190 

ctg aac acc ate gec tgg ttc cag att ggt gaa ttc acc aac gcg gcg 624 
Leu Asn Thr He Ala Trp Phe Gin He Gly Glu Phe Thr Asn Ala Ala 
195 200 205 

ggt acg gtt ttc cac ggt gac att aac cgc ttc tat gee ggt gac ggc 672 
Gly Thr Val Phe His Gly Asp He Asn Arg Phe Tyr Ala Gly Asp Gly 
210 215 220 

acc gcg ggg atg ttc atg tec ggc ttc ttc ccg ate atg atg ttc ggt 720 
Thr Ala Gly Met Phe Met Ser Gly Phe Phe Pro He Met Met Phe Gly 
225 230 235 240 

ctg ccg ggt gcg gcg ctg gcg atg tac ttc gca gca ccg aaa gag cgt 768 
Leu Pro Gly Ala Ala Leu Ala Met Tyr Phe Ala Ala Pro Lys Glu Arg 
245 250 255 

cgt ccg atg gtt ggc ggt atg ctg ctt tct gtt get gtt act gcg ttc 816 
Arg Pro Met Val Gly Gly Met Leu Leu Ser Val Ala Val Thr Ala Phe 
260 265 270 

ctg acc ggt gtg act gag ccg ctg gaa ttc ctg ttc atg ttc ctt get 864 
Leu Thr Gly Val Thr Glu Pro Leu Glu Phe Leu Phe Met Phe lieu Ala 
275 280 285 

ccg ctg ctg tac etc ctg cac gca ctg ctg acc ggt ate age ctg ttt 912 
Pro Leu Leu Tyr Leu Leu His Ala Leu Leu Thr Gly He Ser Leu Phe 
290 295 300 

gtg gca acg ctg ctg ggt ate cac gcg ggc ttc tct ttc tct gcg ggg 960 
Val Ala Thr Leu Leu Gly He His Ala Gly Phe Ser Phe Ser Ala Gly 
305 310 315 320 

get ate gac tac gcg ttg atg tat aac ctg ccg gee gee age cag aac 1008 
Ala He Asp Tyr Ala Leu Met Tyr Asn Leu Pro Ala Ala Ser Gin Asn 
325 330 335 

gtc tgg atg ctg ctg gtg atg ggc gtt ate ttc ttc get ate tac ttc 1056 
Val Trp Met Leu Leu Val Met Gly Val He Phe Phe Ala He Tyr Phe 
340 345 350 

gtg gtg ttc agt ttg gtt ate cgc atg ttc aac ctg aaa acg ccg ggt 1104 
Val Val Phe Ser Leu Val He Arg Met Phe Asn Leu Lys Thr Pro Gly 
355 360 365 

cgt gaa gat aaa gaa gac gag ate gtt act gaa gaa gee aac age aac 1152 
Arg Glu Asp Lys Glu Asp Glu He Val Thr Glu Glu Ala Asn Ser Asn 
370 375 380 

act gaa gaa ggt ctg act caa ctg gca acc aac tat att get gcg gtt 1200 
Thr Glu Glu Gly Leu Thr Gin Leu Ala Thr Asn Tyr He Ala Ala Val 
385 390 395 400 
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ggc ggc act gac aac ctg aaa gcg att gac gcc tgt ate acc cgt ctg 1248 
Gly Gly Thr Asp Asn Leu Lys Ala lie Asp Ala Cys lie Thr Arg Leu 
405 410 415 

cgc ctt aca gtg get gac tct gcc cgc gtt aac gat acg atg tgt aaa 1296 
Arg Leu Thr Val Ala Asp Ser Ala Arg Val Asn Asp Thr Met Cys Lys 
420 425 430 

cgt ctg ggt get tct ggg gta gtg aaa ctg aac aaa cag act att cag 1344 
Arg Leu Gly Ala Ser Gly Val Val Lys Leu Asn Lys Gin Thr lie. Gin 
435 440 445 

gtg att gtt ggc gcg aaa gca gaa tec ate ggc gat gcg atg aag aaa 1392 
Val lie Val Gly Ala Lys Ala Glu Ser He Gly Asp Ala Met Lys Lys 
450 455 460 

gtc gtt gcc cgt ggt ccg gta gcc get gcg tea get gaa gca act ccg 1440 
Val Val Ala Arg Gly Pro Val Ala Ala Ala Ser Ala Glu Ala Thr Pro 
465 470 475 480 

gca act gcc gcg cct gta gca aaa ccg cag get gta cca aac gcg gta 1488 
Ala Thr Ala Ala Pro Val Ala Lys Pro Gin Ala Val Pro Asn Ala Val 
485 490 495 

tct ate gcg gag ctg gta teg ccg att acc ggt gat gtc gtg gca ctg 1536 
Ser He Ala Glu Leu Val Ser Pro He Thr Gly Asp Val Val Ala Leu 
500 505 510 

gat cag gtt cct gac gaa gca ttc gcc age aaa gcg gtg ggt gac ggt 1584 
Asp Gin Val Pro Asp Glu Ala Phe Ala Ser Lys Ala Val Gly Asp Gly 
515 520 525 

gtg gcg gtg aaa ccg aca gat aaa ate gtc gta tea cca gcc gca ggg 1632 
Val Ala Val Lys Pro Thr Asp Lys He Val Val Ser Pro Ala Ala Gly 
530 535 540 

aca ate gtg aaa ate ttc aac acc aac cac gcg ttc tgc ctg gaa acc 1680 
Thr He Val Lys He Phe Asn Thr Asn His Ala Phe Cys Leu Glu Thr 
545 550 555 560 

gaa aaa ggc gcg gag ate gtc gtc cat atg ggt ate gac acc gta gcg 1728 
Glu Lys Gly Ala Glu He Val Val His Met Gly He Asp Thr Val Ala 
565 570 575 

ctg gaa ggt aaa ggc ttt aaa cgt ctg gtg gaa gag ggt gcg cag gta 1776 
Leu Glu Gly Lys Gly Phe Lys Arg Leu Val Glu Glu Gly Ala Gin Val 
580 585 590 

age gca ggg caa ccg att ctg gaa atg gat ctg gat tac ctg aac get 1824 
Ser Ala Gly Gin Pro He Leu Glu Met Asp Leu Asp Tyr Leu Asn Ala 
595 600 605 

aac gcc cgc teg atg att age ccg gtg gtt tgc age aat ate gac gat 1872 
Asn Ala Arg Ser Met He Ser Pro Val Val Cys Ser Asn He Asp Asp 
610 615 620 

ttc agt ggc ttg ate att aaa get cag ggc cat att gtg gcg ggt caa 1920 
Phe Ser Gly Leu He He Lys Ala Gin Gly His He Val Ala Gly Gin 
625 630 635 640 

aca ccg ctg tat gaa ate aaa aag taa 1947 
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Thr Pro Leu Tyr Glu lie Lys Lys * 
645 



<210> 138 
<211> 1239 
<212> DNA 

c213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1239) 

<400> 138 

atg aag act ate ttc agg tac att ctt ttt tta gca ctg tat tct tgt 48 
Met Lys Thr lie Phe Arg Tyr lie Leu Phe Leu Ala Leu Tyr Ser Cys 
15 10 15 

tgt aat aca gtc agt gca tat aca agt ttt att gtg gga aat aat gca 96 
Cys Asn Thr Val Ser Ala Tyr Thr Ser Phe lie Val Gly Asn Asn Ala 
20 25 30 

gga gtt gat aac tat cga ggc ccc tec act gec gca cag atg acc ttt 144 
Gly Val Asp Asn Tyr Arg Gly Pro Ser Thr Ala Ala Gin Met Thr Phe 
35 40 45 

aat tac aca tea aca gca age aac ttg gtt ttt tat aaa ccc acg cag 192 
Asn Tyr Thr Ser Thr Ala Ser Asn Leu Val Phe Tyr Lys Pro Thr Gin 
50 55 60 

etc ggc ccg act ggg gta aaa atg tac tgg tea tac ctg gat aca ggt 240 
Leu Gly Pro Thr Gly Val Lys Met Tyr Trp Ser Tyr Leu Asp Thr Gly 
65 70 75 80 

acc ggt ggt ggt att ctt tac tgc aat aca tct ggc aga gcg aat cct 288 
Thr Gly Gly Gly lie Leu Tyr Cys Asn Thr Ser Gly Arg Ala Asn Pro 
85 90 95 

ggt cca ata act att gaa aat gee atg gtc tat tea ggt aaa gat tat 336 
Gly Pro lie Thr lie Glu Asn Ala Met Val Tyr Ser Gly Lys Asp Tyr 
100 105 110 

99 c S9 a cat aaa cta ttt: aat aca tct SJtt cct ct S tat tac acc 384 

Gly Gly His Lys Leu Phe Asn Thr Ser Val Pro Gly Leu Tyr Tyr Thr 
115 120 125 

atg tta ata tea agg gtc tgg tct gca tac gat aca ata act gac att 432 
Met Leu lie Ser Arg Val Trp Ser Ala Tyr Asp Thr lie Thr Asp He 
130 135 140 

caa teg cca gga ate tat ate gga gat cct tec aac caa gaa ttt ttc 480 
Gin Ser Pro Gly He Tyr He Gly Asp Pro Ser Asn Gin Glu Phe Phe 
145 150 155 160 

ttt tec gtc aca gac age gat cta caa act aag ggt tgc aac aaa gca 528 
Phe Ser Val Thr Asp Ser Asp Leu Gin Thr Lys Gly Cys Asn Lys Ala 
165 170 175 

gac gac tac gat aag ttt tgg get att ggt ggt ata gta cac aac ata 576 
Asp Asp Tyr Asp Lys Phe Trp Ala He Gly Gly He Val His Asn He 
180 185 190 
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act gtt gaa ttt tat aca gat act aat ttc gat cct aca ctt aat cag 624 
Thr Val Glu Phe Tyr Thr Asp Thr Asn Phe Asp Pro Thr Leu Asn Gin 
195 200 205 

caa gtc cag tta tec agt tea tea aat tac ctt tat teg ttt aag gec 672 
Gin Val Gin Leu Ser Ser Ser Ser Asn Tyr Leu Tyr Ser Phe Lys Ala 
210 215 220 

tac agt cct ggt aca aaa gtt gta gat cac age aac cac att tat gtc 720 
Tyr Ser Pro Gly Thr Lys Val Val Asp His Ser Asn His He Tyr Val 
225 230 235 240 

aat ttc aca ctg aat aat gtc aaa tta acc tta cca aca tgt ttt acc 768 
Asn Phe Thr Leu Asn Asn Val Lys Leu Thr Leu Pro Thr Cys Phe Thr 
245 250 255 

tec ata ctt acc ggg cca tea gtc aat ggt tea acg gtt aga atg gga 816 
Ser He Leu Thr Gly Pro Ser Val Asn Gly Ser Thr Val Arg Met Gly 
260 265 270 

gaa tat age tct ggg aca att aaa aat ggc get tea cct gtt ccc ttt 864 
Glu Tyr Ser Ser Gly Thr He Lys Asn Gly Ala Ser Pro Val Pro Phe 
275 280 285 

gat ate teg ctt cag aat tgt att cgt gtg cgt aat att gaa aca aaa 912 
Asp He Ser Leu Gin Asn Cys He Arg Val Arg Asn He Glu Thr Lys 
290 295 300 

etc gtc act gga aaa gta ggt act caa aac aca caa ctg ctt ggt aat 960 
Leu Val Thr Gly Lys Val Gly Thr Gin Asn Thr Gin Leu Leu Gly Asn 
305 310 315 320 

acg ctt act gga age act gee gec aaa ggc gtt ggc gta etc att gaa 1008 
Thr Leu Thr Gly Ser Thr Ala Ala Lys Gly Val Gly Val Leu He Glu 
325 330 335 

ggt tta gca acc agt aaa aat ccg eta atg aca tta aaa cct aat gat 1056 
Gly Leu Ala Thr Ser Lys Asn Pro Leu Met Thr Leu Lys Pro Asn Asp 
340 345 350 

acg aat tct gtt tat ata gat tat gaa acc gaa gat gac acc tec gat 1104 
Thr Asn Ser Val Tyr He Asp Tyr Glu Thr Glu Asp Asp Thr Ser Asp 
355 360 365 

ggg gtt tac cct aat caa ggt aac ggc aca tea cag cct ctt cat ttc 1152 
Gly Val Tyr Pro Asn Gin Gly Asn Gly Thr Ser Gin Pro Leu His Phe 
370 375 380 

cag gca aca tta aag caa gac ggg aat att get ate gaa ccc ggc gaa 1200 
Gin Ala Thr Leu Lys Gin Asp Gly Asn He Ala He Glu Pro Gly Glu 
385 390 395 400 

ttt aaa gec acc agt act ttc cag gta acc tat ccc tga 1239 
Phe Lys Ala Thr Ser Thr Phe Gin Val Thr Tyr Pro * 
405 410 



<210> 139 
<211> 597 
<212> DNA 
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<213> Escherichia coli 

<220> 

<221> CDS 

<222> (I).,. (597) 

<400> 139 

atg cat ccc act caa cgt aag ctg atg aag aga ata att ctg ttt ctg 48 
Met His Pro Thr Gin Arg Lys Leu Met Lys Arg He He Leu Phe Leu 
1 5 10 15 

tea tta ctg ttt tgc ate gec tgt cca gec att get gga cag gat att 96 
Ser Leu Leu Phe Cys He Ala Cys Pro Ala He Ala Gly Gin Asp He 
20 25 30 

gac ctt gtt gec aat gta aaa aac age ace tgc aaa age gga ate agt 144 
Asp Leu Val Ala Asn Val Lys Asn Ser Thr Cys Lys Ser Gly He Ser 
35 40 45 

aac cag ggt aat att gat ctt ggc gtc gtt ggg gtg gga tat ttt tea 192 
Asn Gin Gly Asn He Asp Leu Gly Val Val Gly Val Gly Tyr Phe Ser 
50 55 60 

ggt aat gtt act cct gaa agt tat caa cca ggt gga aaa gag ttc act 240 
Gly Asn Val Thr Pro Glu Ser Tyr Gin Pro Gly Gly Lys Glu Phe Thr 
65 70 75 80 

ate act gta tec gac tgt gca tta cag gga act ggc gat gtg eta aat 288 
He Thr Val Ser Asp Cys Ala Leu Gin Gly Thr Gly Asp Val Leu Asn 
85 90 95 

cag tta cat att gat ttt aga gec ctt age ggt gtc atg get get ggc 336 
Gin Leu His He Asp Phe Arg Ala Leu Ser Gly Val Met Ala Ala Gly 
100 105 110 

tct agg caa ata ttt get aat gaa att teg tea gga gca agt aat gta 384 
Ser Arg Gin He Phe Ala Asn Glu lie Ser Ser Gly Ala Ser Asn Val 
115 120 125 

gga gta gtt ata ttt tct act cag gat teg gcg aat aca ttc aat gtt 432 
Gly Val Val He Phe Ser Thr Gin Asp Ser Ala Asn Thr Phe Asn Val 
130 135 140 

ctt aat get tea ggc gga tct cgt tec gtt tat cca gta atg teg gat 480 
Leu Asn Ala Ser Gly Gly Ser Arg Ser Val Tyr Pro Val Met Ser Asp 
145 150 155 160 

gac atg aat ggt tea tec tgg aaa ttt age acc cga atg caa aaa ate 528 
Asp Met Asn Gly Ser Ser Trp Lys Phe Ser Thr Arg Met Gin Lys He 
165 170 175 

gat cct gca ttg agt gtt aca tct ggt caa ctt atg age cat gtg tta 576 
Asp Pro Ala Leu Ser Val Thr Ser Gly Gin Leu Met Ser His Val Leu 
180 185 190 

gtg gat att tac tac gaa taa 597 
Val Asp He Tyr Tyr Glu * 
195 



<210> 140 
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<211> 606 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (606) 

<400> 140 

atg atg aca ttt aaa aat tta cgt tat gga tta tec age agt gtc gtt 48 
Met Met Thr Phe Lys Asn Leu Arg Tyr Gly Leu Ser Ser Ser Val Val 
15 10 15 

ttg get gee tea ttg ttc age gta etc tct tat gcg gca aca gac agt 96 
Leu Ala Ala Ser Leu Phe Ser Val Leu Ser Tyr Ala Ala Thr Asp Ser 
20 25 30 

a tt 99 a cfc g acc g tfc att acfc act 9 ta <J aa at £T gart act tgt acc get 144 
lie Gly Leu Thr Val He Thr Thr Val Glu Met Gly Thr Cys Thr Ala 
35 40 45 

aca tta gta aat gac tct gat cag gac att tct gtt gtt gat ttt ggt 192 
Thr Leu Val Asn Asp Ser Asp Gin Asp He Ser Val Val Asp Phe Gly 
50 55 60 

gat gta tat att tct gaa ate aat gee aag acc aaa gta aaa aca ttc 240 
Asp Val Tyr He Ser Glu He Asn Ala Lys Thr Lys Val Lys Thr Phe 
65 70 75 80 

aaa etc aaa ttc aaa gac tgt gcg ggt ate ccc aat aaa aaa gcg caa 2BB 
Lys Leu Lys Phe Lys Asp Cys Ala Gly He Pro Asn Lys Lys Ala Gin 
85 90 95 

ata aaa tta acc aag cga gee aca tgc gag gga act get aat gac ggt 336 
He Lys Leu Thr Lys Arg Ala Thr Cys Glu Gly Thr Ala Asn Asp Gly 
100 105 110 

gcg ggg ttt gca aat ggt tec aca gee gca gat aaa gca agt get gtc 384 
Ala Gly Phe Ala Asn Gly Ser Thr Ala Ala Asp Lys Ala Ser Ala Val 
115 120 125 

gee gtt gaa gtc tgg age act gta act ccg gca aca ggg agt gca aca 432 
Ala Val Glu Val Trp Ser Thr Val Thr Pro Ala Thr Gly Ser Ala Thr 
130 135 140 

caa ttt age tgt gta aca cca gca tea caa gag gta aca ate tec act 480 
Gin Phe Ser Cys Val Thr Pro Ala Ser Gin Glu Val Thr He Ser Thr 
145 150 155 160 

gca gee aat gcg gtc gtt tat tat ccg atg agt gca cgc ctg gtc gtg 528 
Ala Ala Asn Ala Val Val Tyr Tyr Pro Met Ser Ala Arg Leu Val Val 
165 170 175 

gaa aaa aat aaa acc gta aac aat gtc act gcg ggt aag ttt tct gca 576 
Glu Lys Asn Lys Thr Val Asn Asn Val Thr Ala Gly Lys Phe Ser Ala 
180 185 190 

cca gec aca ttt aca gta acc tat aac taa 606 
Pro Ala Thr Phe Thr Val Thr Tyr Asn * 
195 200 



-76- 



WO 01/48209 



PCT/US00/34419 



<210> 141 
<211> 612 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (612) 

<400> 141 

atg gaa ttc ggt gtc aga ttc agt aac tac aag gga cgt caa atg ata 48 
Met Glu Phe Gly Val Arg Phe Ser Asn Tyr Lys Gly Arg Gin Met lie 
15 10 15 

aaa aca acg cca cat aaa ata gtg ata ctg atg gga ata tta tta tea 96 
Lys Thr Thr Pro His Lys lie Val lie Leu Met Gly lie Leu Leu Ser 
20 25 30 

ccc tea gta ttt gca acg gat att aat gta gag ttt aca gec act gtc 144 
Pro Ser Val Phe Ala Thr Asp lie Asn Val Glu Phe Thr Ala Thr Val 
35 40 45 

aaa gcg aca acc tgt aac ate aca ctt act ggt aat aac gtc acg aat 192 
Lys Ala Thr Thr Cys Asn lie Thr Leu Thr Gly Asn Asn Val Thr Asn 
50 55 60 

gat ggc aat aat aac tac aca ttg aga ate cct aag atg ggt ctg gat 240 
Asp Gly Asn Asn Asn Tyr Thr Leu Arg He Pro Lys Met Gly Leu Asp 
65 70 75 80 

aag ate gcg aat aaa acg aca gaa tct cag get gat ttt aaa ctg gtt 288 
Lys He Ala Asn Lys Thr Thr Glu Ser Gin Ala Asp Phe Lys Leu Val 
85 90 95 

gec agt ggg tgc age agt ggc ate agt tgg att gat acc act ctg acc 336 
Ala Ser Gly Cys Ser Ser Gly He Ser Trp He Asp Thr Thr Leu Thr 
100 105 110 

gga aat gca tea tea age tea cct aag ctt att ata ccg cag tct ggt 384 
Gly Asn Ala Ser Ser Ser Ser Pro Lys Leu He He Pro Gin Ser Gly 
115 120 125 

gat tea tct teg acg aca agt aat ate ggt atg ggt ttc aaa aaa egg 432 
Asp Ser Ser Ser Thr Thr Ser Asn He Gly Met Gly Phe Lys Lys Arg 
130 135 140 

act act gat gat gee act ttc ctt aaa cct aac agt gcg gaa aag ata 480 
Thr Thr Asp Asp Ala Thr Phe Leu Lys Pro Asn Ser Ala Glu Lys He 
145 150 155 160 

cgc tgg age aca gac gag atg cag ccc gat aag ggt ctt gaa atg acc 528 
Arg Trp Ser Thr Asp Glu Met Gin Pro Asp Lys Gly Leu Glu Met Thr 
165 170 175 

gtt gcg eta cgt gaa aca gat gca ggg caa ggc gta ccg ggg aat ttt 576 
Val Ala Leu Arg Glu Thr Asp Ala Gly Gin Gly Val Pro Gly Asn Phe 
180 185 190 

cgt gca ctg gcg acg ttt aat ttc ate tat caa taa 612 
Arg Ala Leu Ala Thr Phe Asn Phe He Tyr Gin * 
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195 200 



<210> 142 
<211> 420 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (420) 

<400> 142 

atg gca atg act tac cac ctg gac gtc gtc age gca gag caa caa atg 48 
Met Ala Met Thr Tyr His Leu Asp Val Val Ser Ala Glu Gin Gin Met 
15 10 15 

ttc tct ggt ctg gtc gag aaa ate cag gta acg ggt age gaa ggt gaa 96 
Phe Ser Gly Leu Val Glu Lys lie Gin Val Thr Gly Ser Glu Gly Glu 
20 25 30 

ctg ggg ate tac cct ggc cac gca ccg ctg etc ace gee att aag cct 144 
Leu Gly lie Tyr Pro Gly His Ala Pro Leu Leu Thr Ala lie Lys Pro 
35 40 45 

ggt atg att cgc ate gtg aaa cag cac ggt cac gaa gag ttt ate tat 192 
Gly Met. He Arg He Val Lys Gin His Gly His Glu Glu Phe He Tyr 
50 55 60 

ctg tct ggc ggc att ctt gaa gtg cag cct ggc aac gtg acc gtt ctg 240 
Leu Ser Gly Gly He Leu Glu Val Gin Pro Gly Asn Val Thr Val Leu 
65 70 75 80 

gee gac acc gca att cgc ggc cag gat etc gac gaa gcg cga gee atg 288 
Ala Asp Thr Ala He Arg Gly Gin Asp Leu Asp Glu Ala Arg Ala Met 
85 90 95 

gaa gcg aaa cgt aag get gaa gag cac att age age tct cac ggc gac 336 
Glu Ala Lys Arg Lys Ala Glu Glu His He Ser Ser Ser His Gly Asp 
100 105 110 

gta gat tac get cag gcg tct gcg gaa ctg gee aaa gcg ate gcg cag 384 
Val Asp Tyr Ala Gin Ala Ser Ala Glu Leu Ala Lys Ala He Ala Gin 
115 120 125 

ctg cgc gtt ate gag ttg acc aaa aaa gcg atg taa 420 
Leu Arg Val He Glu Leu Thr Lys Lys Ala Met * 
. 130 135 



<210> 143 
<211> 1383 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1383) 

<400> 143 

atg get act gga aag att gtc cag gta ate ggc gec gta gtt gac gtc 48 
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Met Ala Thr Gly Lys lie Val Gin Val lie Gly Ala Val Val Asp Val 
15 10 15 

gaa ttc cct cag gat gcc gta ccg cgc gtg tac gat get ctt gag gtg 96 
Glu Phe Pro Gin Asp Ala Val Pro Arg Val Tyr Asp Ala Leu Glu Val 
20 25 30 

caa aat ggt aat gag cgt ctg gtg ctg gaa gtt cag cag cag etc ggc 144 
Gin Asn Gly Asn Glu Arg Leu Val Leu Glu Val Gin Gin Gin Leu Gly 
35 40 45 

ggc ggt ate gta cgt acc ate gca atg ggt tec tec gac ggt ctg cgt 192 
Gly Gly lie Val Arg Thr lie Ala Met Gly Ser Ser Asp Gly Leu Arg 
50 55 60 

C 9 C ggt ct 9 9 at g ta aaa 9 ac c ^c gaa cac ccg att gaa gtc ccg gta 240 
Arg Gly Leu Asp Val Lys Asp Leu Glu His Pro lie Glu Val Pro Val 
65 70 75 80 

ggt aaa gcg act ctg ggc cgt ate atg aac gta ctg ggt gaa ccg gtc 288 
Gly Lys Ala Thr Leu Gly Arg lie Met Asn Val Leu Gly Glu Pro Val 
85 90 95 

gac atg aaa ggc gag ate ggt gaa gaa gag cgt tgg gcg att cac cgc 336 
Asp Met Lys Gly Glu lie Gly Glu Glu Glu Arg Trp Ala lie His Arg 
100 105 110 

gca gca cct tec tac gaa gag ctg tea aac tct cag gaa ctg ctg gaa 384 
Ala Ala Pro Ser Tyr Glu Glu Leu Ser Asn Ser Gin Glu Leu Leu Glu 
115 120 125 

acc ggt ate aaa gtt ate gac ctg atg tgt ccg ttc get aag ggc ggt 432 
Thr Gly lie Lys Val lie Asp Leu Met Cys Pro Phe Ala Lys Gly Gly 
130 135 140 

aaa gtt ggt ctg ttc ggt ggt gcg ggt gta ggt aaa acc gta aac atg 480 
Lys Val Gly Leu Phe Gly Gly Ala Gly Val Gly Lys Thr Val Asn Met 
145 150 155 160 

atg gag etc att cgt aac ate gcg ate gag cac tec ggt tac tct gtg 528 
Met Glu Leu lie Arg Asn lie Ala lie Glu His Ser Gly Tyr Ser Val 
165 170 175 

ttt gcg ggc gta ggt gaa cgt act cgt gag ggt aac gac ttc tac cac 576 
Phe Ala Gly Val Gly Glu Arg Thr Arg Glu Gly Asn Asp Phe Tyr His 
180 185 190 

gaa atg acc gac tec aac gtt ate gac aaa gta tec ctg gtg tat ggc 624 
Glu Met Thr Asp Ser Asn Val lie Asp Lys Val Ser Leu Val Tyr Gly 
195 200 " 205 

cag atg aac gag ccg ccg gga aac cgt ctg cgc gtt get ctg acc ggt 672 
Gin Met Asn Glu Pro Pro Gly Asn Arg Leu Arg Val Ala Leu Thr Gly 
210 215 220 

ctg acc atg get gag aaa ttc cgt gac gaa ggt cgt gac gtt ctg ctg 720 
Leu Thr Met Ala Glu Lys Phe Arg Asp Glu Gly Arg Asp Val Leu Leu 
225 230 235 240 

ttc gtt gac aac ate tat cgt tac acc ctg gcc ggt acg gaa gta tec 768 
Phe Val Asp Asn He Tyr Arg Tyr Thr Leu Ala Gly Thr Glu val Ser 
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245 250 255 

gca ctg ctg ggc cgt atg cct tea gcg gta ggt tat cag ccg acc ctg 816 
Ala Leu Leu Gly Arg Met Pro Ser Ala Val Gly Tyr Gin Pro Thr Leu 
260 265 270 

gcg gaa gag atg ggc gtt ctg cag gaa cgt ate acc tec acc aaa act 864 
Ala Glu Glu Met Gly Val Leu Gin Glu Arg lie Thr Ser Thr Lys Thr 
275 280 285 

ggt tct ate acc tec gta cag gca gta tac gta cct gcg gat gac ttg 912 
Gly Ser He Thr Ser Val Gin Ala Val Tyr Val Pro Ala Asp Asp Leu 
290 295 300 

act gac ccg tct ccg gca acc acc ttt gcg cac ctt gac gca acc gtg -960 
Thr Asp Pro Ser Pro Ala Thr Thr Phe Ala His Leu Asp Ala Thr Val 
305 310 315 320 

gta ctg age cgt cag ate gcg tct ctg ggt ate tac ccg gec gtt gac 1008 
Val Leu Ser Arg Gin He Ala Ser Leu Gly He Tyr Pro Ala Val Asp 
325 330 335 

ccg ctg gac tec acc age cgt cag ctg gac ccg ctg gtg gtt ggt cag 1056 
Pro Leu Asp Ser Thr Ser Arg Gin Leu Asp Pro Leu Val Val Gly Gin 
340 345 350 

gaa cac tac gac acc gcg cgt ggc gtt cag tec ate ctg caa cgt tat 1104 
Glu His Tyr Asp Thr Ala Arg Gly Val Gin Ser He Leu Gin Arg Tyr 
355 360 365 

cag gaa ctg aaa gac ate ate gee ate ctg ggt atg gat gaa ctg tct 1152 
Gin Glu Leu Lys Asp He He Ala He Leu Gly Met Asp Glu Leu Ser 
370 375 380 

gaa gaa gac aaa ctg gtg gta gcg cgt get cgt aag ate cag cgc ttc 1200 
Glu Glu Asp Lys Leu Val Val Ala Arg Ala Arg Lys He Gin Arg Phe 
385 390 395 400 

ctg tec cag ccg ttc ttc gtg gca gaa gta ttc acc ggt tct ccg ggt 1248 
Leu Ser Gin Pro Phe Phe Val Ala Glu Val Phe Thr Gly Ser Pro Gly 
405 410 415 

aaa tac gtc tec ctg aaa gac acc ate cgt ggc ttt aaa ggc ate atg 1296 
Lys Tyr Val Ser Leu Lys Asp Thr He Arg Gly Phe Lys Gly He Met 
420 425 430 

gaa ggc gaa tac gat cac ctg ccg gag cag gcg ttc tac atg gtc ggt 1344 
Glu Gly Glu Tyr Asp His Leu Pro Glu Gin Ala Phe Tyr Met Val Gly 
435 440 445 

tec ate gaa gaa get gtg gaa aaa gee aaa aaa ctt taa 1383 
Ser He Glu Glu Ala Val Glu Lys Ala Lys Lys Leu * 
450 455 460 



<210> 144 
<211> 864 
<212> DNA 

<213> Escherichia coli 
<220> 
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<221> CDS 

<222> (1) . . . (864) 

<400> 144 

atg gcc ggc gca aaa gag ata cgt agt aag ate gca age gtc cag aac 48 
Met Ala Gly Ala Lys Glu lie Arg Ser Lys \ He Ala Ser Val Gin Asn 
15 10 15 

acg caa aag ate act aaa gcg atg gag atg gtc gcc get tec aaa atg 96 
Thr Gin Lys lie Thr Lys Ala Met Glu Met Val Ala Ala Ser Lys Met 
20 25 30 

cgt aaa teg cag gat cgc atg gcg gcc age cgt cct tat gca gaa ace 144 
Arg Lys Ser Gin Asp Arg Met Ala Ala Ser Arg Pro Tyr Ala Glu Thr 
35 40 45 

atg cgc aaa gtg att ggt cac ctt gca cac ggt aat ctg gaa tat aag 192 
Met Arg Lys Val He Gly His Leu Ala His Gly Asn Leu Glu Tyr Lys 
50 55 60 

cac cct tac ctg gaa gac cgc gac gtt aaa cgc gtg ggc tac ctg gtg 240 
His Pro Tyr Leu Glu Asp Arg Asp Val Lys Arg Val Gly Tyr Leu Val 
65 70 75 80 

gtg teg acc gac cgt ggt ttg tgc ggt ggt ttg aac att aac ctg ttc 288 
Val Ser Thr Asp Arg Gly Leu Cys Gly Gly Leu Asn He Asn Leu Phe 
85 90 95 

aaa aaa ctg ctg gcg gaa atg aag acc tgg acc gac aaa ggc gtt caa 336 
Lys Lys Leu Leu Ala Glu Met Lys Thr Trp Thr Asp Lys Gly Val Gin 
100 105 110 

tgc gac etc gca atg ate ggc teg aaa ggc gtg teg ttc ttc aac tec 384 
Cys Asp Leu Ala Met He Gly Ser Lys Gly Val Ser Phe Phe Asn Ser 
115 120 125 

gtg ggc ggc aat gtt gtt gcc cag gtc acc ggc atg ggg gat aac cct 432 
Val Gly Gly Asn Val Val Ala Gin Val Thr Gly Met Gly Asp Asn Pro 
130 135 140 

tec ctg tec gaa ctg ate ggt ccg gta aaa gtg atg ttg cag gcc tac 480 
Ser Leu Ser Glu Leu He Gly Pro Val Lys Val Met Leu Gin Ala Tyr 
145 150 155 160 

gac gaa ggc cgt ctg gac aag ctt tac att gtc age aac aaa ttt att 528 
Asp Glu Gly Arg Leu Asp Lys Leu Tyr He Val Ser Asn Lys Phe He 
165 170 175 

aac acc atg tct cag gtt ccg acc ate age cag ctg ctg ccg tta ccg 576 
Asn Thr Met Ser Gin Val Pro Thr He Ser Gin Leu Leu Pro Leu Pro 
180 185 190 

gca tea gat gat gat gat ctg aaa cat aaa tec tgg gat tac ctg tac 624 
Ala Ser Asp Asp Asp Asp Leu Lys His Lys Ser Trp Asp Tyr Leu Tyr 
195 200 205 

gaa ccc gat ccg aag gcg ttg ctg gat acc ctg ctg cgt cgt tat gtc 672 
Glu Pro Asp Pro Lys Ala Leu Leu Asp Thr Leu Leu Arg Arg Tyr Val 
210 215 220 

gaa tct cag gtt tat cag ggc gtg gtt gaa aac ctg gcc age gag cag 720 
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Glu Ser Gin Val Tyr Gin Gly Val Val Glu Asn Leu Ala Ser Glu Gin 
225 230 235 240 

gcc gcc cgt atg gtg gcg atg aaa gcc gcg acc gac aat ggc ggc age 768 
Ala Ala Arg Met Val Ala Met Lys Ala Ala Thr Asp Asn Gly Gly Ser 
245 250 255 

ctg att aaa gag ctg cag ttg gta tac aac aaa get cgt cag gcc age 816 
Leu lie Lys Glu Leu Gin Leu Val Tyr Asn Lys Ala Arg Gin Ala Ser 
260 265 , 270 

att act cag gaa etc acc gag ate gtc teg ggg gcc gcc gcg gtt taa 864 
lie Thr Gin Glu Leu Thr Glu lie Val Ser Gly Ala Ala Ala Val * 
275 280 285 



<210> 145 
<211> 1542 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1542) 

<400> 145 

atg caa ctg aat tec acc gaa ate age gaa ctg ate aag cag cgc att 48 

Met Gin Leu Asn Ser Thr Glu lie Ser Glu Leu lie Lys Gin Arg He 

15 10 15 

get cag ttc aat gtt gtg agt gaa get cac aac gaa ggt act att gtt 96 
Ala Gin Phe Asn Val Val Ser Glu Ala His Asn Glu Gly Thr He Vai 
20 25 30 

tct gta agt gac ggt gtt ate cgc att cac ggc ctg gcc gat tgt atg 144 
Ser Val Ser Asp Gly Val He Arg He His Gly Leu Ala Asp Cys Met 
35 40 45 

cag ggt gaa atg ate tec ctg ccg ggt aac cgt tac get ate gca ctg 192 
Gin Gly Glu Met He Ser Leu Pro Gly Asn Arg Tyr Ala He Ala Leu 
50 55 60 

aac etc gag cgc gac tct gta ggt gcg gtt gtt atg ggt ccg tac get 240 
Asn Leu Glu Arg Asp Ser Val Gly Ala Val Val Met Gly Pro Tyr Ala 
65 70 75 80 

gac ctt gcc gaa ggc atg aaa gtt aag tgt act ggc cgt ate ctg gaa 288 
Asp Leu Ala Glu Gly Met Lys Val Lys Cys Thr Gly Arg He Leu Glu 
85 90 95 

gtt ccg gtt ggc cgt ggc ctg ctg ggc cgt gtg gtt aac act ctg ggt 336 
Val Pro Val Gly Arg Gly Leu Leu Gly Arg Val Val Asn Thr Leu Gly 
100 105 110 

gca cca ate gac ggt aaa ggt ccg ctg gat cac gac ggc ttc tct get 384 
Ala Pro He Asp Gly Lys Gly Pro Leu Asp His Asp Gly Phe Ser Ala 
115 120 ' 125 

gta gaa gca ate get ccg ggc gtt ate gaa cgt cag tec gta gat cag 432 
Val Glu Ala He Ala Pro Gly Val He Glu Arg Gin Ser Val Asp Gin 
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130 135 140 

ccg gta cag acc ggt tat aaa gcc gtt gac tec atg ate cca ate ggt 480 
Pro Val Gin Thr Gly Tyr Lys Ala Val Asp Ser Met lie Pro He Gly 
145 150 155 160 

cgt ggt cag cgt gaa ttg ate ate ggt gac cgt cag aca ggt aaa acc 528 
Arg Gly Gin Arg Glu Leu lie He Gly Asp Arg Gin Thr Gly Lys Thr 
165 170 175 

gca ctg get ate gat gcc ate ate aac cag cgc gat tec ggt ate aaa 576 
Ala Leu Ala He Asp Ala He He Asn Gin Arg Asp Ser Gly He Lys 
180 185 190 

tgt ate tat gtc get ate ggc cag aaa gcg tec acc att tct aac gtg 624 
Cys He Tyr Val Ala He Gly Gin Lys Ala Ser Thr He Ser Asn Val 
195 200 205 

gta cgt aaa ctg gaa gag cac ggc gca ctg get aac acc ate gtt gtg 672 
Val Arg Lys Leu Glu Glu His Gly Ala Leu Ala Asn Thr He Val Val 
210 215 220 

gta gca acc gcg tct gaa tec get gca ctg caa tac ctg gca ccg tat 720 
Val Ala Thr Ala Ser Glu Ser Ala Ala Leu Gin Tyr Leu Ala Pro Tyr 
225 230 235 240 

gcc ggt tgc gca atg ggc gaa tac ttc cgt gac cgc ggt gaa gat gcg 768 
Ala Gly Cys Ala Met Gly Glu Tyr Phe Arg Asp Arg Gly Glu Asp Ala 
245 250 255 

ctg ate att tac gat gac ctg tct aaa cag get gtt get tac cgt cag 816 
Leu He He Tyr Asp Asp Leu Ser Lys Gin Ala Val Ala Tyr Arg Gin 
260 265 270 

ate tec ctg ctg etc cgt cgt ccg cca gga cgt gaa gca ttc ccg ggc 864 
He Ser Leu Leu Leu Arg Arg Pro Pro Gly Arg Glu Ala Phe Pro Gly 
275 280 285 

gac gtt ttc tac etc cac tct cgt ctg ctg gag cgt get gca cgt gtt 912 
Asp Val Phe Tyr Leu His Ser Arg Leu Leu Glu Arg Ala Ala Arg Val 
290 295 300 

aac gcc gaa tac gtt gaa gcc ttc acc aaa ggt gaa gtg aaa ggg aaa 960 
Asn Ala Glu Tyr Val Glu Ala Phe Thr Lys Gly Glu Val Lys Gly Lys 
305 310 315 320 

acc ggt tct ctg acc gca ctg ccg att ate gaa act cag gcg ggt gac 1008 
Thr Gly Ser Leu Thr Ala Leu Pro He He Glu Thr Gin Ala Gly Asp 
325 330 335 

gtt tct gcg ttc gtt ccg acc aac. gta ate tec att acc gat ggt cag 1056 
Val Ser Ala Phe Val Pro Thr Asn Val He Ser He Thr Asp Gly Gin 
340 345 350 

ate ttc ctg gaa acc aac ctg ttc aac gcc ggt att cgt cct gcg gtt 1104 
He Phe Leu Glu Thr Asn Leu Phe Asn Ala Gly He Arg Pro Ala Val 
355 360 365 

aac ccg ggt att tec gta tec cgt gtt ggt ggt gca gca cag acc aag 1152 
Asn Pro Gly He Ser Val Ser Arg Val Gly Gly Ala Ala Gin Thr Lys 
370 375 380 
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ate atg aaa aaa ctg tec ggt ggt ate cgt ace get ctg gca cag tat 1200 
lie Met Lys Lys Leu Ser Gly Gly lie Arg Thr Ala Leu Ala Gin Tyr 
385 390 395 400 

cgt gaa ctg gca gcg ttc tct cag ttt gca tec gac ctt gac gat gca 1248 
Arg Glu Leu Ala Ala Phe Ser Gin Phe Ala Ser Asp Leu Asp Asp Ala 
405 410 415 

aca cgt aag cag ctt gac cac ggt cag aaa gtg ace gaa ctg ctg aaa 1296 
Thr Arg Lys Gin Leu Asp His Gly Gin Lys Val Thr Glu Leu Leu Lys 
420 425 . 430 

cag aaa cag tat gcg ccg atg tec gtt gcg cag cag tct ctg gtt ctg 1344 
Gin Lys Gin Tyr Ala Pro Met Ser Val Ala Gin Gin Ser Leu Val Leu 
435 440 445 

ttc gca gca gaa cgt ggt tac ctg gcg gat gtt gaa ctg teg aaa att 1392 
Phe Ala Ala Glu Arg Gly Tyr Leu Ala Asp Val Glu Leu Ser Lys He 
450 455 460 

ggc age ttc gaa gee get ctg ctg get tac gtc gac cgt gat cac get 1440 
Gly Ser Phe Glu Ala Ala Leu Leu Ala Tyr Val Asp Arg Asp His Ala 
465 470 475 480 

ccg ttg atg caa gag ate aac cag acc ggt ggc tac aac gac gaa ate 1488 
Pro Leu Met Gin Glu He Asn Gin Thr Gly Gly Tyr Asn Asp Glu He 
485 490 495 

gaa ggc aag ctg aaa ggc ate etc gat tec ttc aaa gca acc caa tec 1536 
Glu Gly Lys Leu Lys Gly He Leu Asp Ser Phe Lys Ala Thr Gin Ser 
500 505 510 

tgg taa 1542 
Trp * 



<210> 146 
<211> 534 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(534) 

<400> 146 

atg tct gaa ttt att acg gta get cgc ccc tac gee aaa gca get ttt 48 
Met Ser Glu Phe He Thr Val Ala Arg Pro Tyr Ala Lys Ala Ala Phe 
15 10 15 

gac ttt gee gtc gaa cac caa agt gta gaa cgc tgg cag gac atg ctg 96 
Asp Phe Ala Val Glu His Gin Ser Val Glu Arg Trp Gin Asp Met Leu 
20 25 30 

gcg ttt gee gee gag gta acc aaa aac gaa caa atg gca gag ctt etc 144 
Ala Phe Ala Ala Glu Val Thr Lys Asn Glu Gin Met Ala Glu Leu Leu 
35 40 45 

tct ggc gcg ctt gcg cca gaa acg etc gee gag teg ttt ate gca gtt 192 
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Ser Gly Ala Leu Ala Pro Glu Thr Leu Ala Glu Ser Phe lie Ala Val 
50 55 60 

tgt ggt gag caa ctg gac gaa aac ggt cag aac ctg att egg gtt atg 240 
Cys Gly Glu Gin Leu Asp Glu Asn Gly Gin Asn Leu He Arg Val Met 
65 70 75 80 

get gaa aat ggt cgt ctt aac gcg etc ccg gat gtt ctg gag cag ttt 288 
Ala Glu Asn Gly Arg Leu Asn Ala Leu Pro Asp Val Leu Glu Gin Phe 
85 90 95 

att cac ctg cgt gec gtg agt gag get acc get gag gta gac gtc att 336 
He His Leu Arg Ala Val Ser Glu Ala Thr Ala Glu Val Asp Val He 
100 105 110 

tec get gee gca ctg agt gaa caa cag etc gcg aaa att tct get gcg 384 
Ser Ala Ala Ala Leu Ser Glu Gin Gin Leu Ala Lys He Ser Ala Ala 
115 120 125 

atg gaa aaa cgt ctg tea cgc aaa gtt aag ctg aat tgc aaa ate gat 432 
Met Glu Lys Arg Leu Ser Arg Lys Val Lys Leu Asn Cys Lys He Asp 
130 135 140 

aag tct gta atg gca ggc gtt ate ate cga gcg ggt gat atg gtc att 480 
Lys Ser Val Met Ala Gly Val He He Arg Ala Gly Asp Met Val He 
145 150 155 160 

gat ggc age gta cgc ggt cgt ctt gag cgc ctt gca gac gtc ttg cag 528 
Asp Gly Ser Val Arg Gly Arg Leu Glu Arg Leu Ala Asp Val Leu Gin 
165 170 175 

tct taa 534 
Ser * 



<210> 147 
<211> 471 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (471) 

<400> 147 

gtg aat ctt aac gca aca ate etc ggc cag gee ate gcg ttt gtc ctg 48 
Met Asn Leu Asn Ala Thr He Leu Gly Gin Ala He Ala Phe Val Leu 
1 5 10 15 

ttc gtt ctg ttc tgc atg aag tac gta tgg ccg cca tta atg gca gee 96 
Phe Val Leu Phe Cys Met Lys Tyr Val Trp Pro Pro Leu Met Ala Ala 
20 25 30 

ate gaa aaa cgt caa aaa gaa att get gac ggc ctt get tec gca gaa 144 
He Glu Lys Arg Gin Lys Glu He Ala Asp Gly Leu Ala Ser Ala Glu 
35 40 45 

cga gca cat aag gac ctt gac ctt gca aag gee age gcg acc gac cag 192 
Arg Ala His Lys Asp Leu Asp Leu Ala Lys Ala Ser Ala Thr Asp Gin 
50 55 60 
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ctg aaa aaa gcg aaa gcg gaa gcc cag gta ate ate gag cag gcg aac 240 
Leu Lys Lys Ala Lys Ala Glu Ala Gin Val lie lie Glu Gin Ala Asn 
65 70 75 80 

aaa cgc cgc teg cag att ctg gac gaa gcg aaa get gag gca gaa cag 288 
Lys Arg Arg Ser Gin He Leu Asp Glu Ala Lys Ala Glu Ala Glu Gin 
85 90 95 

gaa cgt act aaa ate gtg gcc cag gcg cag gcg gaa att gaa gcc gag 336 
Glu Arg Thr Lys He Val Ala Gin Ala Gin Ala Glu He Glu Ala Glu 
100 105 110 

cgt aaa cgt gcc cgt gaa gag ctg cgt aag caa gtt get ate ctg get 384 
Arg Lys Arg Ala Arg Glu Glu Leu Arg Lys Gin Val Ala He Leu Ala 
115 120 125 

gtt get ggc gcc gag aag ate ate gaa cgt tec gtg gat gaa get get 432 
Val Ala Gly Ala Glu Lys He He Glu Arg Ser Val Asp Glu Ala Ala 
130 135 140 

aac age gac ate gtg gat aaa ctt gtc get gaa ctg taa 471 
Asn Ser Asp He Val Asp Lys Leu Val Ala Glu Leu * 
145 150 155 



<210> 148 
<211> 240 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (240) 

<400> 148 

atg gaa aac ctg aat atg gat ctg ctg tac atg get gcc get gtg atg 48 
Met Glu Asn Leu Asn Met Asp Leu Leu Tyr Met Ala Ala Ala Val Met 
15 10 15 

atg ggt ctg gcg gca ate ggt get gcg ate ggt ate ggc ate etc ggg 96 
Met Gly Leu Ala Ala He Gly Ala Ala He Gly He Gly He Leu Gly 
20 25 30 

ggt aaa ttc ctg gaa ggc gca gcg cgt caa cct gat ctg att cct ctg 144 
Gly Lys Phe Leu Glu Gly Ala Ala Arg Gin Pro Asp Leu He Pro Leu 
35 40 45 

ctg cgt act cag ttc ttt ate gtt atg ggt ctg gtg gat get ate ccg 192 
Leu Arg Thr Gin Phe Phe He Val Met Gly Leu Val Asp Ala He Pro 
50 55 60 



atg ate get gta ggt ctg ggt ctg tac gtg atg ttc get gtc gcg tag 240 
Met He Ala Val Gly Leu Gly Leu Tyr Val Met Phe Ala Val Ala * 
65 70 75 



<210> 149 
<211> 816 
<212> DNA 
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<2X3> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (816) 

<400> 149 

atg get tea gaa aat atg acg ccg cag gat tac ata gga cac cac ctg • 48 
Met Ala Ser Glu Asn Met Thr Pro Gin Asp Tyr lie Gly His His Leu 
15 10 15 

aat aac ctt cag ctg gac ctg cgt aca ttc teg ctg gtg gat cca caa 96 
Asn Asn Leu Gin Leu Asp Leu Arg Thr Phe Ser Leu Val Asp Pro Gin 
20 25 30 

aac ccc cca gee acc ttc tgg aca ate aat att gac tec atg ttc ttc 144 
Asn Pro Pro Ala Thr Phe Trp Thr lie Asn lie Asp Ser Met Phe Phe 
35 40 45 

teg gtg gtg ctg ggt ctg ttg ttc ctg gtt tta ttc cgt age gta gec 192 
Ser Val Val Leu Gly Leu Leu Phe Leu Val Leu Phe Arg Ser Val Ala 
50 ~ 55 60 

aaa aag gcg acc age ggt gtg cca ggt aag ttt cag acc gcg att gag 240 
Lys Lys Ala Thr Ser Gly Val Pro Gly Lys Phe Gin Thr Ala lie Glu 
65 70 75 80 

ctg gtg ate ggc ttt gtt aat ggt age gtg aaa gac atg tac cat ggc 288 
Leu Val lie Gly Phe Val Asn Gly Ser Val Lys Asp Met Tyr His Gly 
85 90 95 

aaa age aag ctg att get ccg ctg gee ctg acg ate ttc gtc tgg gta 336 
Lys Ser Lys Leu lie Ala Pro Leu Ala Leu Thr lie Phe Val Trp Val 
100 105 110 

ttc ctg atg aac ctg atg gat tta ctg cct ate gac ctg ctg ccg tac 384 
Phe Leu Met Asn Leu Met Asp Leu Leu Pro lie Asp Leu Leu Pro Tyr 
115 120 125 

att get gaa cat gta ctg ggt ctg cct gca ctg cgt gtg gtt ccg tct 432 
lie Ala Glu His Val Leu Gly Leu Pro Ala Leu Arg Val Val Pro Ser 
. 130 135 140 

gcg gac gtg aac gta acg ctg tct atg gca ctg ggc gta ttt ate ctg 480 
Ala Asp Val Asn Val Thr Leu Ser Met Ala Leu Gly Val Phe lie Leu 
145 150 155 160 

att ctg ttc tac age ate aaa atg aaa ggc ate ggc ggc ttc acg aaa 528 
lie Leu Phe Tyr Ser lie Lys Met Lys Gly lie Gly Gly Phe Thr Lys 
165 170 175 

gag ttg acg ctg cag ccg ttc aat cac tgg gcg ttc att cct gtc aac 576 
Glu Leu Thr Leu Gin Pro Phe Asn His Trp Ala Phe lie Pro Val Asn 
180 185 190 

tta ate ctt gaa ggg gta age ctg ctg tec aaa cca gtt tea etc ggt 624 
Leu lie Leu Glu Gly Val Ser Leu Leu Ser Lys Pro Val Ser Leu Gly 
195 200 205 

ttg cga ctg ttc ggt aac atg tat gee ggt gag ctg att ttc att ctg 672 
Leu Arg Leu Phe Gly Asn Met Tyr Ala Gly Glu Leu lie Phe lie Leu 
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210 215 220 

i 

att get ggt ctg ttg ccg tgg tgg tea cag tgg ate ctg aat gtg ccg 720 
lie Ala Gly Leu Leu Pro Trp Trp Ser Gin Trp lie Leu Asn Val Pro 
225 230 235 240 

tgg 9 CC att ttc cac atc ct 9 atc att ac 9 ctg caa gee ttc ate ttc 768 
Trp Ala lie Phe His He Leu He He Thr Leu Gin Ala Phe lie Phe 
245 250 255 

a tg gtt ctg acg atc gtc tat ctg teg atg gcg tct gaa gaa cat taa 816 
Met Val Leu Thr lie Val Tyr Leu Ser Met Ala Ser Glu Glu His * 
260 265 270 



<210> 150 
<211> 393 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (393) 

<400> 150 

gtg aaa aac gtg atg tct gtg teg etc gtg agt cga aac gtt get egg 48 

Met Lys Asn Val Met Ser Val Ser Leu Val Ser Arg Asn Val Ala Arg 
15 10 15 

aag ctt ctg etc gtt cag tta ctg gtg gtg at a gca agt gga ttg ctg 96 
Lys Leu Leu Leu Val Gin Leu Leu Val Val He Ala Ser Gly Leu Leu 
20 25 30 

ttc age etc aaa gac ccc ttc tgg ggc gtc tct gca ata age ggg ggc 144 
Phe Ser Leu Lys Asp Pro Phe Trp Gly Val Ser Ala He Ser Gly Gly 
35 40 45 

ctg gca gtc ttt ctg cct aac gtt ttg ttt atg ata ttt gee tgg cgt 192 
Leu Ala Val Phe Leu Pro Asn Val Leu Phe Met He Phe Ala Trp Arg 
50 55 60 

cac cag gcg cat aca cca gcg aaa ggc egg gtg gee tgg aca ttc gca 240 
His Gin Ala His Thr Pro Ala Lys Gly Arg Val Ala Trp Thr Phe Ala 
65 70 75 80 

ttt ggc gaa get ttc aaa gtt ctg gcg atg ttg gtg tta ctg gtg gtg 288 
Phe Gly Glu Ala Phe Lys Val Leu Ala Met Leu Val Leu Leu Val Val 
85 90 95 

gcg ttg gcg gtt tta aag gcg gta ttc ttg ccg ctg atc gtt acg tgg 336 
Ala Leu Ala Val Leu Lys Ala Val Phe Leu Pro Leu He Val Thr Trp 
100 105 110 

gtt ttg gtg ctg gtg gtt cag ata ctg gca ccg get gta att aac aac 384 
Val Leu Val Leu Val Val Gin He Leu Ala Pro Ala Val He Asn Asn 
115 120 125 

aaa ggg taa 393 
Lys Gly * 
130 
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<210> 151 
<211> 1242 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 

<222> (1) . . . (1242) 



<400> 151 

atg caa aaa cac ggc gac cgc tat gtg tgg ate aac ccg cct get att 48 
Met Gin Lys His Gly Asp Arg Tyr Val Trp lie Asn Pro Pro Ala lie ■ 
1 5 10 15 



ccg ctt tct acc gaa gag atg gac age gtt ttt gcg ctg cca tac aag 96 
Pro Leu Ser Thr Glu Glu Met Asp Ser Val Phe Ala Leu Pro Tyr Lys 
20 25 30 

cgc gtg cca cat ccg gee tat ggc aat gee cgt att ccg get tac gaa 144 
Arg Val Pro His Pro Ala Tyr Gly Asn Ala Arg lie Pro Ala Tyr Glu t 
35 40 45 

atg ate cgt ttt teg gtc aac att atg cgt ggc tgc ttt ggc ggc tgc 192 
Met lie Arg Phe Ser Val Asn He Met Arg Gly Cys Phe Gly Gly Cys 
50 55 60 



tct ttc tgt tct ate acc gag cac gaa ggg cgc att att cag age cgt 240 
Ser Phe Cys Ser lie Thr Glu His Glu Gly Arg lie lie Gin Ser Arg 
65 70 75 80 



tec gaa gat teg ate att aat gag ate gaa gcg ate cgc gac acc gtt 288 
Ser Glu Asp Ser lie lie Asn Glu He Glu Ala He Arg Asp Thr Val 
85 90 95 

cca ggt ttt acg ggc gtg att tec gat ctt ggt ggg cca act gee aac 336 
Pro Gly Phe Thr Gly Val He Ser Asp Leu Gly Gly Pro Thr Ala Asn 
100 105 110 

atg tat atg ttg cgc tgc aaa teg cca cgc get gaa caa act tgt cgc 384 
Met Tyr Met Leu Arg Cys Lys Ser Pro Arg Ala Glu Gin Thr Cys Arg 
115 120 125 



cgt ttg teg tgc gtt tat ccg gat att tgt cc<j cac atg gac acg aac 432 
Arg Leu Ser Cys Val Tyr Pro Asp He Cys Pro His Met Asp Thr Asn 
130 135 140 

cac gaa cca acg ate aac etc tat cgc cgt gcg cgt gat ctg aaa ggc 480 
His Glu Pro Thr He Asn Leu Tyr Arg Arg Ala Arg Asp Leu Lys Gly 
145 150 155 160 

att aaa aag ate ctg att gec tct ggt gtg cgt tat gac ata gee gta 528 
He Lys Lys He Leu He Ala Ser Gly Val Arg Tyr Asp He Ala Val 
165 170 175 

gaa gat ccg cgc tat ate aaa gaa ctg gcg acc cat cac gtc ggc ggt 576 
Glu Asp Pro Arg Tyr He Lys Glu Leu Ala Thr His His Val Gly Gly 
180 185 190 



tat ctg aag att gee ccg gaa cat acc gaa gaa ggg ccg tta teg aag 624 
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Tyr Leu Lys lie Ala Pro Glu His Thr Glu Glu Gly Pro Leu Ser Lys 
195 200 205 

atg atg aag ccg ggc atg ggc age tat gac cgc ttt aaa gag ctg ttc 672 
Met Met Lys Pro Gly Met Gly Ser Tyr Asp Arg Phe Lys Glu Leu Phe 
210 215 220 

gat act tac tea aaa cag gca ggt aaa gag cag tat ctg ate ccc tat 720 
Asp Thr Tyr Ser Lys Gin Ala Gly Lys Glu Gin Tyr Leu lie Pro Tyr 
225 230 235 240 

ttc ate tec gcg cac ccc ggt acg cgt gat gaa gat atg gtg aat ctg 768 
Phe lie Ser Ala His Pro Gly Thr Arg Asp Glu Asp Met Val Asn Leu 
245 250 255 

gcg ctg tgg ctg aaa aag cac cgc ttc cgc etc gac cag gtg cag aac 816 
Ala Leu Trp Leu Lys Lys His Arg Phe Arg Leu Asp Gin Val Gin Asn 
260 265 270 

ttc tat ccg teg ccg ctg gcg aac tea acc acc atg tat tac ace ggg 864 
Phe Tyr Pro Ser Pro Leu Ala Asn Ser Thr Thr Met Tyr Tyr Thr Gly 
275 280 285 

aaa aac ccg ctg gcg aag att ggt tat aag age gaa gac gtc ttc gta 912 
Lys Asn Pro Leu Ala Lys lie Gly Tyr Lys Ser Glu Asp Val Phe Val 
290 295 300 

ccg aag ggc gac aaa cag cgt cgt ttg cat aaa gcg ttg ttg cgt tac 960 
Pro Lys Gly Asp Lys Gin Arg Arg Leu His Lys Ala Leu Leu Arg Tyr 
305 310 315 320 

cac gat ccg gca aac tgg ccg tta ate cgc cag gcg ctg gaa gcg atg 1008 
His Asp Pro Ala Asn Trp Pro Leu lie Arg Gin Ala Leu Glu Ala Met 
325 330 335 

ggc aaa aag cat ctg att ggc age cgt cgc gat tgc tta gtg cct gcg 1056 
Gly Lys Lys His Leu lie Gly Ser Arg Arg Asp Cys Leu Val Pro Ala 
340 345 350 

cca acc att gaa gag atg cgt gaa get cgt cgc cag aac cgc aat acc 1104 
Pro Thr lie Glu Glu Met Arg Glu Ala Arg Arg Gin Asn Arg Asn Thr 
355 360 365 

cgt ccg gcg ttg acg aaa cat acg ccg atg gcg acc cag cgt cag acg 1152 
Arg Pro Ala Leu Thr Lys His Thr Pro Met Ala Thr Gin Arg Gin Thr 
370 375 380 

cct get acg gca aaa aaa gcg teg tct acg caa tct cgt ccg gtg aat 1200 
Pro Ala Thr Ala Lys Lys Ala Ser Ser Thr Gin Ser Arg Pro Val Asn 
385 390 395 400 

get ggt gcg aag aaa egg cct aaa gcg gcg gtt gga cgt taa 1242 
Ala Gly Ala Lys Lys Arg Pro Lys Ala Ala Val Gly Arg * 
405 410 



<210> 152 
<211> 978 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1)...(978) 

<400> 152 

atg age tct ate tec ctg ate caa ccg gat cgc gac ctg ttc tec tgg 48 
Met Ser Ser lie Ser Leu He Gin Pro Asp Arg Asp Leu Phe Ser Trp 
15 10 15 

ccg cag tac tgg gee gee tgt ttt gga ccg gca ccg ttt ttg ccg atg 96 
Pro Gin Tyr Trp Ala Ala Cys Phe Gly Pro Ala Pro Phe Leu Pro Met 
20 25 30 

tct cgt gaa gag atg gat caa ctt ggc tgg gat age tgc gac ate att 144 
Ser Arg Glu Glu Met Asp Gin Leu Gly Trp Asp Ser Cys Asp He He 
35 40 45 

ttg gtt act ggc gac gcg tat gtc gat cac cca age ttc ggg atg gcg 192 
Leu Val Thr Gly Asp Ala Tyr Val Asp His Pro Ser Phe Gly Met Ala 
50 55 60 

att tgc ggt cgt atg ctg gaa gcg cag ggc ttt cgc gtc ggg ate ate 240 
He Cys Gly Arg Met Leu Glu Ala Gin Gly Phe Arg Val Gly He He 
65 70 75 80 

gee cag ccg gac tgg age age aaa gac gac ttt atg cgt ctg ggt aaa 288 
Ala Gin Pro Asp Trp Ser Ser Lys Asp Asp Phe Met Arg Leu Gly Lys 
85 90 95 

ccg aat ctg ttt ttc ggt gtt act get ggc aac atg gat teg atg ate 336 
Pro Asn Leu Phe Phe Gly Val Thr Ala Gly Asn Met Asp Ser Met He 
100 105 110 

aac cgt tat ace gee gat cgc cgt tta cgt cat gac gat gee tac acg 384 
Asn Arg Tyr Thr Ala Asp Arg Arg Leu Arg His Asp Asp Ala Tyr Thr 
115 120 125 

ccg gat aac gtc gcg ggt aag cgc ccg gat cgc gee aca ctg gtt tat 432 
Pro Asp Asn Val Ala Gly Lys Arg Pro Asp Arg Ala Thr Leu Val Tyr 
130 135 140 

ace cag cgt tgt aaa gag gcg tgg aaa gat gta ccg gtg ate etc ggc 480 
Thr Gin Arg Cys Lys Glu Ala Trp Lys Asp Val Pro Val He Leu Gly 
145 150 155 160 

ggt att gag get agt ctg cgc cgt ace gcg cat tat gat tac tgg tec 528 
Gly He Glu Ala Ser Leu Arg Arg Thr Ala His Tyr Asp Tyr Trp Ser 
165 170 175 

gat ace gtg cgc cgt tec gtg ctg gtg gat teg aaa gee gac atg ctg 576 
Asp Thr Val Arg Arg Ser Val Leu Val Asp Ser Lys Ala Asp Met Leu 
180 " 185 190 

atg ttt ggt aac ggt gag cgt ccg ctg gtg gaa gtg gcg cat cgt ctg 624 
Met Phe Gly Asn Gly Glu Arg Pro Leu Val Glu Val Ala His Arg Leu 
195 200 205 

gcg atg ggc gag cca att agt gaa ate cgc gat gtg cgt aat ace gcg 672 
Ala Met Gly Glu Pro He Ser Glu He Arg Asp Val Arg Asn Thr Ala 
210 215 220 
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att ate gtg aaa gag gcg ctg cct ggc tgg age ggc gtg gat tec ace 720 
lie He Val Lys Glu Ala Leu Pro Gly Trp Ser Gly Val Asp Ser Thr 
225 230 235 240 

cgt ctt gat acc cct gga aaa ate gac cca ate ccg cat ccg tat ggt 768 
Arg Leu Asp Thr Pro Gly Lys He Asp Pro He Pro His Pro Tyr Gly 
245 250 255 

gaa gat ttg ccg tgc gcg gat aac aaa ccg gtg gca ccg aaa aag cag 8 IS 
Glu Asp Leu Pro Cys Ala Asp Asn Lys Pro Val Ala Pro Lys Lys Gin 
260 265 270 

gaa gee aaa gee gta acc gtg cag cca ccg cgc ccg aaa ccg tgg gaa 864 
Glu Ala Lys Ala Val Thr Val Gin Pro Pro Arg Pro Lys Pro Trp Glu 
275 280 285 

aaa acc tac gtg ttg ctg cct tct ttc gag aaa gtg aag ggc gat aaa 912 
Lys Thr Tyr Val Leu Leu Pro Ser Phe Glu Lys Val Lys Gly Asp Lys 
290 295 300 

gtg ctg tac gec cat get teg cgt att ctg cac cac gaa acc aac cca 960 
Val Leu Tyr Ala His Ala Ser Arg lie Leu His His Glu Thr Asn Pro 
305 310 315 320 

get gtg ccc gcg cat tga 978 
Ala Val Pro Ala His * 
325 



<210> 153 
<211> 681 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (681) 

<400> 153 

atg att cag tat ctg aac gtc ttt ttt tac gat ate tac ccg tac att 48 

Met He Gin Tyr Leu Asn Val Phe Phe Tyr Asp He Tyr Pro Tyr He 
1 5 10 15 

tgt gcg acg gtg ttt ttc etc ggc age tgg ctg cgc tac gac tac ggg 96 
Cys Ala Thr Val Phe Phe Leu Gly Ser Trp Leu Arg Tyr Asp Tyr Gly 
20 25 30 

cag tac acc tgg cgc gee tec tea agt caa atg etc gat aaa cgc ggg 144 
Gin Tyr Thr Trp Arg Ala Ser Ser Ser Gin Met Leu Asp Lys Arg Gly 
35 40 45 

atg gtg ata tgg teg aat ctg ttc cat ate ggc att ttg ggg att ttc 192 
Met Val He Trp Ser Asn Leu Phe His He Gly He Leu Gly He Phe 
50 55 60 

ttc ggg cac ttg ttc ggc atg tta acg ccg cac tgg atg tac gcg tgg 240 
Phe Gly His Leu Phe Gly Met Leu Thr Pro His Trp Met Tyr Ala Trp 
65 70 75 80 

ttt ttg cca gtg gca gcg aaa caa etc atg gcg atg gtg etc ggc ggt 288 



-92- 



WO 01/48209 



PCT/DS00/34419 



Phe Leu Pro Val Ala Ala Lys Gin Leu Met Ala Met Val Leu Gly Gly 
85 90 95 

att tgc ggt gtg ttg acg ttg att ggc ggc gca gga ctg ctg tgg cgc 336 
lie Cys Gly Val Leu Thr Leu He Gly Gly Ala Gly Leu Leu Trp Arg 
100 105 110 

agg ctg aca aat cag cgc gtg cgt gcc act tec act acg ccg gat ate 384 
Arg Leu Thr Asn Gin Arg Val Arg Ala Thr Ser Thr Thr Pro Asp He 
115 120 125 

ate ate atg age att ttg ctg ate cag tgt ctg ttg ggg eta agt acc 432 
He He Met Ser He Leu Leu He Gin Cys Leu Leu Gly Leu Ser Thr 
130 135 140 

at a ccg ttt tea gca cag tat cct gat ggt age gaa atg atg aag ctg 480 
He Pro Phe Ser Ala Gin Tyr Pro Asp Gly Ser Glu Met Met Lys Leu 
145 150 155 160 

gtg ggt tgg gcg caa age att gtc act ttc cgt ggt gga teg tea gag 528 
Val Gly Trp Ala Gin Ser He Val Thr Phe Arg Gly Gly Ser Ser Glu 
165 170 175 

atg etc aac ggc gta gcg ttc gtc ttc cgc ctg cat ctg gtg ttg gga 576 
Met Leu Asn Gly Val Ala Phe Val Phe Arg Leu His Leu Val Leu Gly 
180 185 190 

atg acc ate ttc ctg etc ttc ccg ttc acc cga ctg gtg cat gtg tgg 624 
Met Thr He Phe Leu Leu Phe Pro Phe Thr Arg Leu Val His Val Trp 
195 200 205 

age gcg ccg ttt gag tac ttt act cgt cga tat caa att gtg cgt teg 672 
Ser Ala Pro Phe Glu Tyr Phe Thr Arg Arg Tyr Gin He Val Arg Ser 
210 215 220 

egg cga taa 681 

Arg Arg * 

225 



<210> 154 
<211> 696 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (696) 

<400> 154 

atg cag ate etc aaa gtg ate ggc ctg ttg atg gag tat ccg gac gag 48 

Met Gin He Leu Lys Val He Gly Leu Leu Met Glu Tyr Pro Asp Glu 
1 5 10 15 

ctg ttg tgg gaa tgc aag gag gac gcg ctg gcg ttg ate cgc cgc gac 96 
Leu Leu Trp Glu Cys Lys Glu Asp Ala Leu Ala Leu He Arg Arg Asp 
20 25 30 

gcg ccg atg ctt acg gat ttc acc cac aac ctg ctt aac gcg ccg ctg 144 
Ala Pro Met Leu Thr Asp Phe Thr His Asn Leu Leu Asn Ala Pro Leu 
35 40 45 
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ctg gat aaa cag gcc gaa tgg tgc gaa gtg ttt gac cgc ggg cgc acc 192 
Leu Asp Lys Gin Ala Glu Trp Cys Glu Val Phe Asp Arg Gly Arg Thr 
50 55 .60 

acg teg ctg ctg ctg ttc gaa cat gtt cat gcc gag tec cgc gat cgc 240 
Thr Ser Leu Leu Leu Phe Glu His Val His Ala Glu Ser Arg Asp Arg 
65 70 75 80 

ggc cag gca atg gtg gac ctg ctg gcg gag tat gaa aag gtc ggc ctg 288 
Gly Gin Ala Met Val Asp Leu Leu Ala Glu Tyr Glu Lys Val Gly Leu 
85 90 95 

cag ctg gat tgt egg gaa ctg ccc gat tat eta ccg ctg tat ctg gag 336 
Gin Leu Asp Cys Arg Glu Leu Pro Asp Tyr Leu Pro Leu Tyr Leu Glu 
100 105 110 

tat tta age gta ctg cct gac gat cag gcg aaa gaa gga ttg etc aat 384 
Tyr Leu Ser Val Leu Pro Asp Asp Gin Ala Lys Glu Gly Leu Leu Asn 
115 120 125 

gtt gcg ccg ate etc gcc ctg ctt ggc ggt cgc tta aaa caa cgc gag 432 
Val Ala Pro He Leu Ala Leu Leu Gly Gly Arg Leu Lys Gin Arg Glu 
130 135 140 

gca ccg tgg tac gcg ttg ttt gat get ctg ctg caa ctg gcg gga age 480 
Ala Pro Trp Tyr Ala Leu Phe Asp Ala Leu Leu Gin Leu Ala Gly Ser 
145 150 155 160 

tec ctt tea agt gac agt gtc aca aaa caa gta aac age gaa gag cgc 528 
Ser Leu Ser Ser Asp Ser Val Thr Lys Gin Val Asn Ser Glu Glu Arg 
165 170 175 

gat gac acc cgc cag gcg ctt gat gca gtg tgg gaa gag gaa cag gtg 576 
Asp Asp Thr Arg Gin Ala Leu Asp Ala Val Trp Glu Glu Glu Gin Val 
180 185 190 

aag ttt att gaa gat aac gcc acg gcg tgt gac age teg ccg ctt aat 624 
Lys Phe He Glu Asp Asn Ala Thr Ala Cys Asp Ser Ser Pro Leu Asn 
195 200 205 

caa tat cag cga cgc ttt age cag gat gtc gcg ccg caa tat gtc gac 672 
Gin Tyr Gin Arg Arg Phe Ser Gin Asp Val Ala Pro Gin Tyr Val Asp 
210 215 220 

ate agt gcg gga ggt ggg aaa tga 696 
He Ser Ala Gly Gly Gly Lys * 
225 230 



<210> 155 
<211> 1545 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1545) 

<400> 155 

atg aaa ate cgt tea caa gtc ggc atg gtg ctt aac etc gac aaa tgt 48 
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Met Lys lie Arg Ser Gin Val Gly Met Val Leu Asn Leu Asp Lys Cys 
15 10 15 

ate ggc tgc cat acc tgt teg gtg acc tgt aaa aac gtc tgg acc ggg 96 
lie Gly Cys His Thr Cys Ser Val Thr Cys Lys Asn Val Trp Thr Gly 
20 . 25 30 

cgc gaa ggc atg gag tac gca tgg ttt aac aac gtc gaa acc aaa ccg 144 
Arg Glu Gly Met Glu Tyr Ala Trp Phe Asn Asn Val Glu Thr Lys Pro 
35 40 45 

ggc att ggt tat ccg aaa aac tgg gaa gat cag gaa gag tgg caa ggc 192 
Gly lie Gly Tyr Pro Lys Asn Trp Glu Asp Gin Glu Glu Trp Gin Gly 
50 55 60 

ggc tgg gtg cgt gat gtg aat ggc aag at a cgc ccg cgt ctg ggc aac 240 
Gly Trp Val Arg Asp Val Asn Gly Lys lie Arg Pro Arg Leu Gly Asn 
65 70 75 80 

aag atg ggc gta ata acc aaa ate ttc gec aac ccg gtg gtg ccg cag 288 
Lys Met Gly Val lie Thr Lys lie Phe Ala Asn Pro Val Val Pro Gin 
85 90 95 

att gat gat tac tac gaa cct ttc acc ttc gac tac gaa cat ttg cat 336 
lie Asp Asp Tyr Tyr Glu Pro Phe Thr Phe Asp Tyr Glu His Leu His 
100 105 110 

age gca ccg gaa ggc aaa cat att cct act get cgc ccg cgt tea ctg 384 
Ser Ala Pro Glu Gly Lys His He Pro Thr Ala Arg Pro Arg Ser Leu 
115 120 125 

att gac ggc aaa egg atg gac aaa gtg ate tgg ggg cca aac tgg gaa 432 
He Asp Gly Lys Arg Met Asp Lys Val He Trp Gly Pro Asn Trp Glu 
130 135 140 

gaa ctg ctg ggc ggt gag ttc gaa aaa cgt gec cgc gac cgc aac ttc 480 
Glu Leu Leu Gly Gly Glu Phe Glu Lys Arg Ala Arg Asp Arg Asn Phe 
145 150 155 160 

gag gec atg caa aag gag atg tac ggg cag ttt gaa aac acc ttc atg 528 
Glu Ala Met Gin Lys Glu Met Tyr Gly Gin Phe Glu Asn Thr Phe Met 
165 170 175 

atg tac ctg ccg cgc ctg tgc gaa cac tgc etc aat ccc age tgc gtg 576 
Met Tyr Leu Pro Arg Leu Cys Glu His Cys Leu Asn Pro Ser Cys Val 
180 185 190 

gcg acc tgc cca age ggc get ate tac aaa cgc gaa gaa gac ggc att 624 
Ala Thr Cys Pro Ser Gly Ala He Tyr Lys Arg Glu Glu Asp Gly He 
195 200 205 

gtg ctg att gat cag gat aaa tgc cgt ggc tgg cgt ttg tgc ata age 672 
Val Leu He Asp Gin Asp Lys Cys Arg Gly Trp Arg Leu Cys He Ser 
210 215 220 

ggt tgt ccg tac aaa aaa ate tac ttc aac tgg aaa age ggc aag tea 720 
Gly Cys Pro Tyr Lys Lys He Tyr Phe Asn Trp Lys Ser Gly Lys Ser 
225 230 235 240 

gaa aaa tgc ate ttc tgt tac cca cga att gag tec ggt caa ccg acc 768 
Glu Lys Cys He Phe Cys Tyr Pro Arg He Glu Ser Gly Gin Pro Thr 
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245 250 255 

gtg tgc tea gaa acc tgc gtg ggt cgc ate egg tat ctg ggc gtg ctg 816 
Val Cys Ser Glu Thr Cys Val Gly Arg lie Arg Tyr Leu Gly Val Leu 
260 265 270 

ctt tac gac gee gac cgc ate gag gaa gcg gcg age acc gag cgc gaa 864 
Leu Tyr Asp Ala Asp Arg lie Glu Glu Ala Ala Ser Thr Glu Arg Glu 
275 280 285 

gtt gac etc tat gaa cgc cag tgc gaa gtg ttc etc gat cca cac gat 912 
Val Asp Leu Tyr Glu Arg Gin Cys Glu Val Phe Leu Asp Pro His Asp 
290 295 300 

ccc tea gtg ate gag gaa gee ctg aaa caa ggt att cca caa aat gtg 960 
Pro Ser Val lie Glu Glu Ala Leu Lys Gin Gly lie Pro Gin Asn Val 
305 310 315 320 

att gac get gee cag egt teg cca gtc tac aaa atg gcg atg gac tgg 1008 
lie Asp Ala Ala Gin Arg Ser Pro Val Tyr Lys Met Ala Met Asp Trp 
325 330 335 

aaa ctg gca eta ccg ttg cac cct gaa tac cgc acc ctg ccg atg gtc 1056 
Lys Leu Ala Leu Pro Leu His Pro Glu Tyr Arg Thr Leu Pro Met Val 
340 345 350 

tgg tac gtt cct ccg ctg tea ccg att cag tec tac gca gat gcg ggc 1104 
Trp Tyr Val Pro Pro Leu Ser Pro lie Gin Ser Tyr Ala Asp Ala Gly 
355 360 365 

ggt ttg ccg aaa age gaa ggc gtg ctg ccc gee ate gaa age ctg cgt 1152 
Gly Leu Pro Lys Ser Glu Gly Val Leu Pro Ala He Glu Ser Leu Arg 
370 375 380 

att ccg gtg caa tat etc gee aat atg ttg agt gec ggc gat acc ggt 1200 
He Pro Val Gin Tyr Leu Ala Asn Met Leu Ser Ala Gly Asp Thr Gly 
385 390 395 400 

ccg gta ctg egg gcg ctg aaa egg atg atg gcg atg cgc cac tat atg 1248 
Pro Val Leu Arg Ala Leu Lys Arg Met Met Ala Met Arg His Tyr Met 
405 410 415 

cgt tea caa acc gtg gaa ggc gtt act gat act cgt gec ate gac gaa 1296 
Arg Ser Gin Thr Val Glu Gly Val Thr Asp Thr Arg Ala He Asp Glu 
420 425 430 

gta ggc ctg age gtc gee cag gtc gaa gag atg tat cgc tat etc gec 1344 
Val Gly Leu Ser Val Ala Gin Val Glu Glu Met Tyr Arg Tyr Leu Ala 
435 440 445 

att gee aac tat gaa gat cgt ttt gtt ate ccg acc age cat egg gaa 1392 
He Ala Asn Tyr Glu Asp Arg Phe Val He Pro Thr Ser His Arg Glu 
450 455 460 

atg gcg ggc gat gee ttc gca gaa cgc aac ggc tgc ggt ttt acc ttt 1440 
Met Ala Gly Asp Ala Phe Ala Glu Arg Asn Gly Cys Gly Phe Thr Phe 
465 470 475 480 

ggc gac ggt tgc cac ggt teg gac agt aaa ttc aac ctg ttc aac agt 1488 
Gly Asp Gly Cys His Gly Ser Asp Ser Lys Phe Asn Leu Phe Asn Ser 
485 490 495 
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age cgt ate gat gec ate aac ate ace gaa gtg cgc gac aaa gcg gag 1536 
Ser Arg He Asp Ala He Asn He Thr Glu Val Arg Asp Lys Ala Glu 
500 505 510 

ggc gaa taa 1545 
Gly Glu * 



<210> 156 
<211> 3741 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (3741) 

<400> 156 

atg agt aaa ctt ttg gat cgc ttt cgc tac ttc aaa caa aag ggc gaa 48 

Met Ser Lys Leu Leu Asp Arg Phe Arg Tyr Phe Lys Gin Lys Gly Glu 
1 5 10 15 ■ 

acc ttt gec gat ggt cac gga cag gtg atg cat age aac cgc gac tgg 96 
Thr Phe Ala Asp Gly His Gly Gin Val Met His Ser Asn Arg Asp Trp 
20 25 30 

gag gac age tat cgc cag cgt tgg cag ttc gac aaa ate gtg cgt tec 144 
Glu Asp Ser Tyr Arg Gin Arg Trp Gin Phe Asp Lys He Val Arg Ser 
35 40 45 

acc cac ggt gtt aac tgt aca ggc tec tgt age tgg aaa ate tac gtt 192 
Thr His Gly Val Asn Cys Thr Gly Ser Cys Ser Trp Lys He Tyr Val 
50 55 60 

aaa aat ggt ctg gtg acc tgg gaa ate caa cag acc gac tac ccg cgc 240 
Lys Asn Gly Leu Val Thr Trp Glu He Gin Gin Thr Asp Tyr Pro Arg 
65 70 75 80 

act cgc cct gac ctg ccc aat cat gaa cct cgc ggc tgc ccg cgt ggc 288 
Thr Arg Pro Asp Leu Pro Asn His Glu Pro Arg Gly Cys Pro Arg Gly 
85 90 95 

gca agt tac tec tgg tat ctt tac age get aac cgc ctg aaa tac ccg 336 
Ala Ser Tyr Ser Trp Tyr Leu Tyr Ser Ala Asn Arg Leu Lys Tyr Pro 
100 105 110 

etc att cgt aaa cga ctg att gaa ctg tgg cgc gaa gee etc aag caa 384 
Leu He Arg Lys Arg Leu He Glu Leu Trp Arg Glu Ala Leu Lys Gin 
115 120 125 

cac age gat ccg gta ctg gcg tgg gca teg att atg aac gat ccg caa 432 
His Ser Asp Pro Val Leu Ala Trp Ala Ser He Met Asn Asp Pro Gin 
130 135 140 

aag tgc ctg age tac aaa caa gtg cgt ggg cgc ggc ggg ttt ate cgc 480 
Lys Cys Leu Ser Tyr Lys Gin Val Arg Gly Arg Gly Gly Phe He Arg 
145 150 155 160 

tec aac tgg cag gaa eta aac cag ctg att gee gee get aac gtc tgg 528 
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Ser Asn Trp Gin Glu Leu Asn Gin Leu lie Ala Ala Ala Asn Val Trp 
165 170 175 

acc ate aaa acc tac ggc ccg gat cgc gtt gec ggt ttc teg ccg ate 576 
Thr lie Lye Thr Tyr Gly Pro Asp Arg Val Ala Gly Phe Ser Pro lie 
180 185 190 

ccg gcg atg teg atg gtt tct tac gee gee gga acg cgt tat ctg teg 624 
Pro Ala Met Ser Met Val Ser Tyr Ala Ala Gly Thr Arg Tyr Leu Ser 
195 200 205 

ctg ctt ggc ggc acc tgt tta agt ttc tac gac tgg tat tgc gac ctg 672 
Leu Leu Gly Gly Thr Cys Leu Ser Phe Tyr Asp Trp Tyr Cys Asp Leu 
210 215 220 

ccg ccc gec teg ccg atg acc tgg ggc gag caa acc gac gta ccg gaa 720 
Pro Pro Ala Ser Pro Met Thr Trp Gly Glu Gin Thr Asp Val Pro Glu 
225 230 235 240 

tct gec gac tgg tat aac tec age tac ate ate gec tgg ggg tct aac 768 
Ser Ala Asp Trp Tyr Asn Ser Ser Tyr lie lie Ala Trp Gly Ser Asn 
245 250 255 

gta ccg cag aca cgt acg ccg gac gee cac ttc ttt acc gaa gta cgc 816 
Val 'pro Gin Thr Arg Thr Pro Asp Ala His Phe Phe Thr Glu Val Arg 
260 265 270 

tac aaa ggc act aaa acc ate gec att acc cct gac tac tct gaa gtg 864 
Tyr Lys Gly Thr Lys Thr lie Ala lie Thr Pro Asp Tyr Ser Glu Val 
275 280 285 

gec aaa ttg tgc gac cag tgg ctg gca ccg aaa caa ggc act gat age 912 
Ala Lys Leu Cys Asp Gin Trp Leu Ala Pro Lys Gin Gly Thr Asp Ser 
290 295 300 

gec ctg gcg atg gca atg ggc cat gtg att tta aaa gag ttt cat etc 960 
Ala Leu Ala Met Ala Met Gly His Val lie Leu Lys Glu Phe His Leu 
305 310 315 320 

gat aat ccc age gac tac ttt ate aac tac tgc cgc cgc tac age gac 1008 
Asp Asn Pro Ser Asp Tyr Phe lie Asn Tyr Cys Arg Arg Tyr Ser Asp 
325 330 335 

atg ccg atg ctg gta atg ctg gag cct cgc gac gat ggt age tac gtt 1056 
Met Pro Met Leu Val Met Leu Glu Pro Arg Asp Asp Gly Ser Tyr Val 
340 345 350 

ccc ggg cgc atg ate cgc gca tct gac ctg gtg gat gga ctg ggc gaa 1104 
Pro Gly Arg Met He Arg Ala Ser Asp Leu Val Asp Gly Leu Gly Glu 
355 360 365 

age aac aat ccg cag tgg aaa acc gta gca gtt aat acc gca ggt gaa 1152 
Ser Asn Asn Pro Gin Trp Lys Thr Val Ala Val Asn Thr Ala Gly Glu 
370 375 380 

ttg gta gtg ccg aac ggt teg att ggt ttc cgc tgg gga gaa aaa ggc 1200 
Leu Val Val Pro Asn Gly Ser He Gly Phe Arg Trp Gly Glu Lys Gly 
385 390 395 400 

aaa tgg aat ctg gaa tec att gee gee ggt acg gaa acc gaa ttg teg 1248 
Lys Trp Asn Leu Glu Ser He Ala Ala Gly Thr Glu Thr Glu Leu Ser 
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405 410 415 

tta acc ctg etc ggt caa cat gac get gtt gca ggc gtg gec ttc ccc 1296 
Leu Thr Leu Leu Gly Gin His Asp Ala Val Ala Gly Val Ala Phe Pro 
420 *" 425 430 

tac ttt ggc ggc att gaa aat cog cat ttt cgc age gta aaa- cac aat 1344 
Tyr Phe Gly Gly lie Glu Asn Pro His Phe Arg Ser Val Lys His Asn 
435 440 445 

ccg gtg ctg gtg cgc caa ttg ccc gtt aaa aac ctg acg tta gtc gat 1392 
Pro Val Leu Val Arg Gin Leu Pro Val Lys Asn Leu Thr Leu Val Asp 
450 ~ 455 460 

ggc aac acc tgt ccg gtg gtc age gtt tat gat ttg gta ctg gcg aat 1440 
Gly Asn Thr Cys Pro Val Val Ser Val Tyr Asp Leu Val Leu Ala Asn 
465 470 475 480 

tac ggc etc gat cgc ggg ctg gaa gat gaa aac agt gcg aaa gat tac 1488 
Tyr Gly Leu Asp Arg Gly Leu Glu Asp Glu Asn Ser Ala Lys Asp Tyr 
485 490 495 

get gaa ate aaa ccg tac acc cca gee tgg ggt gag caa att acc ggc 1536 
Ala Glu lie Lys Pro Tyr Thr Pro Ala Trp Gly Glu Gin lie Thr Gly 
500 505 510 

gtg ccg cgc cag tat att gaa acc ate gec cgt gaa ttt gee gat acc 1584 
Val Pro Arg Gin Tyr lie Glu Thr lie Ala Arg Glu Phe Ala Asp Thr 
515 520 ~ 525 

gec cat aaa acg cat ggg cgc teg atg att ate etc ggc gca ggt gtt 1632 
Ala His Lys Thr His Gly Arg Ser Met lie He Leu Gly Ala Gly Val 
530 535 540 

aac cac tgg tat cac atg gac atg aac tac cgc ggg atg ate aat atg 1680 
Asn His Trp Tyr His Met Asp Met Asn Tyr Arg Gly Met He Asn Met 
545 550 555 560 

ctg ate ttc tgc ggt tgt gtc ggg caa age ggt ggc ggc tgg gca cac 1728 
Leu lie Phe Cys Gly Cys Val Gly Gin Ser Gly Gly Gly Trp Ala His 
565 570 575 

tat gtc ggc cag gaa aaa ctg cgc cca caa acc ggc tgg ttg cca ctg 1776 
Tyr Val Gly Gin Glu Lys Leu Arg Pro Gin Thr Gly Trp Leu Pro Leu 
580 585 590 

gee ttt gcg etc gac tgg aac cga cca ccg cgc caa atg aac age acc 1824 
Ala Phe Ala Leu Asp Trp Asn Arg Pro Pro Arg Gin Met Asn Ser Thr 
595 600 605 

teg ttt ttc tac aat cat tec age caa tgg cgc tat gaa aaa gtc tct 1872 
Ser Phe Phe Tyr Asn His Ser Ser Gin Trp Arg Tyr Glu Lys Val Ser 
610 615 620 

get cag gag tta ctt tea ccg etc gee gat gee agt aag tac age ggt 1920 
Ala Gin Glu Leu Leu Ser Pro Leu Ala Asp Ala Ser Lys Tyr Ser Gly 
625 630 635 640 

cat ctg att gat ttc aac gtt cgc gee gaa cgt atg ggc tgg eta cct 1968 
His Leu He Asp Phe Asn Val Arg Ala Glu Arg Met Gly Trp Leu Pro 
645 650 655 
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tct gcg ccg cag ctg ggg cgt aac ccg etc ggg att aaa get gaa gec 2016 
Ser Ala Pro Gin Leu Gly Arg Asn Pro Leu Gly He Lys Ala Glu Ala 
660 665 670 

gac aag gec gga tta tec ccc aca gaa ttt ace gee cag gcg ctg aaa 2064 
Asp Lys Ala Gly Leu Ser Pro Thr Glu Phe Thr Ala Gin Ala Leu Lys 
675 680 685 

teg ggc gat tta cgt atg gee tgc gaa caa cca gat age age age aat 2112 
Ser Gly Asp Leu Arg Met Ala Cys Glu Gin Pro Asp Ser Ser Ser Asn 
690 695 700 

cat ccg cgt aat ttg ttt gtc tgg cgc tct aac ctg ctt ggc tec tec 2160 
His Pro Arg Asn Leu Phe Val Trp Arg Ser Asn Leu Leu Gly Ser Ser 
705 710 715 720 

ggc aaa ggc cac gag tat atg cag aag tat ctg ctg ggg ace gaa age 2208 
Gly Lys Gly His Glu Tyr Met Gin Lys Tyr Leu Leu Gly Thr Glu Ser 
725 730 735 

ggg att cag ggc gag gaa etc ggt gee age gac ggg ate aaa ccg gaa 2256 
Gly He Gin Gly Glu Glu Leu Gly Ala Ser Asp Gly He Lys Pro Glu 
740 745 750 

gaa gtc gag tgg caa act gca gcg att gaa ggc aag etc gac ctg ctg 2304 
Glu Val Glu Trp Gin Thr Ala Ala He Glu Gly Lys Leu Asp Leu Leu 
755 760 765 

gtg acg etc gac ttc cgc atg tec agt ace tgc ctg ttc tec gat ate 2352 
Val Thr Leu Asp Phe Arg Met Ser Ser Thr Cys Leu Phe Ser Asp He 
770 775 780 

gtt ctg ccc acc gee ace tgg tac gaa aaa gac gat atg aac acc teg 2400 
Val Leu Pro Thr Ala Thr Trp Tyr Glu Lys Asp Asp Met Asn Thr Ser 
785 790 795 800 

gat atg cat ccg ttt att cat ccg ctt tct gcg gcg gtc gat ccg gcg 2448 
Asp Met His Pro Phe He His Pro Leu Ser Ala Ala Val Asp Pro Ala 
805 810 815 

t^ST gaa tea cgc age gac tgg gaa ate tac aaa ggt ate gee aaa gca 2496 
Trp Glu Ser Arg Ser Asp Trp Glu He Tyr Lys Gly He Ala Lys Ala 
820 825 830 

ttt teg caa gtg tgc gtg ggt cat ctt ggc aaa gaa acc gac gtg gta 2544 
Phe Ser Gin Val Cys Val Gly His Leu Gly Lys Glu Thr Asp Val Val 
835 840 845 

tta caa ccc eta ctg cat gac tct ccg gca gag etc tea cag ccg tgt 2592 
Leu Gin Pro Leu Leu His Asp Ser Pro Ala Glu Leu Ser Gin Pro Cys 
850 855 860 

gaa gtg etc gac tgg cgc aaa ggc gaa tgc gat ctg att cca ggc aaa 2640 
Glu Val Leu Asp Trp Arg Lys Gly Glu Cys Asp Leu He Pro Gly Lys 
865 870 875 880 

acc gcg ccg aat att gtg gcg gtg gag cgc gac tac cct get acg tat 2688 
Thr Ala Pro Asn He Val Ala . Val Glu Arg Asp Tyr Pro Ala Thr Tyr 
885 890 * 895 
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gaa cgc ttt acc teg etc ggg cca ttg atg gac aaa ctt ggc aat ggc 2736 
Glu Arg Phe Thr Ser Leu Gly Pro Leu Met Asp Lys Leu Gly Asn Gly 
900 905 910 

ggt aaa ggg att teg tgg aat acc cag gat gaa ate gat ttc etc ggc 2784 
Gly Lys Gly lie Ser Trp Asn Thr Gin Asp ' Glu lie Asp Phe Leu Gly 
915 920 925 

aaa etc aat tac acc aag cgt gat ggc ccg gca cag ggg cgt ccg ctg 2832 
Lys Leu Asn Tyr Thr Lys Arg Asp Gly Pro Ala Gin Gly Arg Pro Leu 
930 935 940 

att gac acc gec att gac get tea gaa gtg att ctg gca ctg gca ccg 2880 
lie Asp Thr Ala lie Asp Ala Ser Glu Val lie Leu Ala Leu Ala Pro 
945 950 955 960 

gaa acc aac ggt cat gtt gca gtc aaa gcg tgg cag gcg ctg ggc gag 2928 
Glu Thr Asn Gly His Val Ala Val Lys Ala Trp Gin Ala Leu Gly Glu 
965 970 975 

ate acc ggg cgc gaa cat acc cat ctg gcg ctg cac aaa gag gac gag 2976 
lie Thr Gly Arg Glu His Thr His Leu Ala Leu His Lys Glu Asp Glu 
980 985 990 

aag att cgc ttt cgc gat att cag gcg cag ccg cgt aaa att ate tec 3024 
Lys lie Arg Phe Arg Asp lie Gin Ala Gin Pro Arg Lys He He Ser 
995 1000 1005 

age ccc acc tgg tec ggt ctg gaa age gat cac gtc tec tat aac gcg 3072 
Ser Pro Thr Trp Ser Gly Leu Glu Ser Asp His Val Ser Tyr Asn Ala 
1010 1015 1020 

gga tac acc aac gtt cat gag tta att ccg tgg cgc acg ctg teg gga 3120 
Gly Tyr Thr Asn Val His Glu Leu He Pro Trp Arg Thr Leu Ser Gly 
1025 1030 1035 1040 

cgc cag cag etc tat cag gat cat ccg tgg atg cgt get ttt ggt gaa 3168 
Arg Gin Gin Leu Tyr Gin Asp His Pro Trp Met Arg Ala Phe Gly Glu 
1045 1050 1055 

age ctg gtg get tat cgc ccg cct ate gac acc cgt age gtc agt gag 3216 
Ser Leu Val Ala Tyr Arg Pro Pro He Asp Thr Arg Ser Val Ser Glu 
1060 1065 1070 

atg cgc cag ata ccg cca aac ggc ttc ccg gaa aaa gca ctt aac ttc 3264 
Met Arg Gin He Pro Pro Asn Gly Phe Pro Glu Lys Ala Leu Asn Phe 
1075 1080 1085 

ctg acg ccg cac cag aaa tgg ggc att cac tea acc tac agt gaa aac 3312 
Leu Thr Pro His Gin Lys Trp Gly He His Ser Thr Tyr Ser Glu Asn 
1090 1095 1100 

ctg eta atg ctg acg etc tct cgc ggt gga ccg att gtc tgg ate age 3360 
Leu Leu Met Leu Thr Leu Ser Arg Gly Gly Pro He Val Trp lie Ser 
1105 1110 1115 1120 

gaa aca gat gec cga gaa ctg acc att gtc gat aac gac tgg gtg gaa 3408 
Glu Thr Asp Ala Arg Glu Leu Thr He Val Asp Asn Asp Trp Val Glu 
1125 1130 1135 

gta ttc aac gec aac ggc gcg ctg act gec cgc gcg gtg gtc age caa 3456 
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Val Phe Asn Ala Asn Gly Ala Leu Thr Ala Arg Ala Val Val Ser Gin 
1140 1145 1150 

c £ft gta ccg ccg ggc atg acc atg atg tat cac gcc cag gaa cgc att 3504 
Arg Val Pro Pro Gly Met Thr Met Met Tyr His Ala Gin Glu Arg lie 
1155 1160 1165 

atg aat att cct ggt teg gaa gta act ggc atg cgc ggc ggc att cat 3552 
Met Asn lie Pro Gly Ser Glu Val Thr Gly Met Arg Gly Gly lie His 
1170 1175 1180 

aac teg gtt acc cgc gtt tgc ccg aaa cca acg cat atg att ggc ggt 3600 
Asn Ser Val Thr Arg Val Cys Pro Lys Pro Thr His Met lie Gly Gly 
1185 1190 1195 1200 

tac gcg cag ctg gcc tgg ggc ttt aac tac tac ggc acc gtc gga teg 3648 
Tyr Ala Gin Leu Ala Trp Gly Phe Asn Tyr Tyr Gly Thr Val Gly Ser 
1205 1210 1215 

aac cgc gat gag ttc ate atg ate cgc aag atg aag aac gtt aac tgg 3696 
Asn Arg Asp Glu Phe lie Met lie Arg Lys Met Lys Asn Val Asn Trp 
1220 1225 1230 

ctg gat gat gaa ggt cgc gat cag gta cag gag gcg aaa aaa tga 3741 
Leu Asp Asp Glu Gly Arg Asp Gin Val Gin Glu Ala Lys Lys * 
1235 1240 1245 



<210> 157 
<211> 498 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (498) 

<400> 157 

atg gat ttg tea cag eta aca cca cgt cgt ccc tat ctg ctg cgt gca 48 
Met Asp Leu Ser Gin Leu Thr Pro Arg Arg Pro Tyr Leu Leu Arg Ala 
15 10 15 

ttc tat gag tgg ttg ctg gat aac cag etc acg ccg cac ctg gtg gtg 96 
Phe Tyr Glu Trp Leu Leu Asp Asn Gin Leu Thr Pro His Leu Val Val 
20 25 30 

gat gtg acg etc cct ggc gtg cag gtt cct atg gaa tat gcg cgt gac 144 
Asp Val Thr Leu Pro Gly Val Gin Val Pro Met Glu Tyr Ala Arg Asp 
35 40 45 

ggg caa ate gta etc aac att gcg ccg cgt get gtc ggc aat ctg gaa 192 
Gly Gin He Val Leu Asn He Ala Pro Arg Ala Val Gly Asn Leu Glu 
50 55 60 

ctg gcg aat gat gag gtg cgc ttt aac gcg cgc ttt ggt ggc att ccg 240 
Leu Ala Asn Asp Glu Val Arg Phe Asn Ala Arg Phe Gly Gly He Pro 
65 70 75 80 

cgt cag gtt tct gtg ccg ctg get gcc gtg ctg get ate tac gcc cgt 288 
Arg Gin Val Ser Val Pro Leu Ala Ala Val Leu Ala He Tyr Ala Arg 
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85 90 95 

gaa aat ggc gca ggc acg atg ttt gag cct gaa get gec tac gat gaa 336 
Glu Asn Gly Ala Gly Thr Met Phe Glu Pro Glu Ala Ala Tyr Asp Glu 
100 105 110 

gat acc age ate atg aat gat gaa gag gca teg gca gac aac gaa acc 384 
Asp Thr Ser lie Met Asn Asp Glu Glu Ala Ser Ala Asp Asn Glu Thr 
115 120 125 

gtt atg teg gtt att gat ggc gac aag cca gat cac gat gat gac act 432 
Val Met Ser Val lie Asp Gly Asp Lys Pro Asp His Asp Asp Asp Thr 
130 135 140 

cat cct gac gat gaa cct ccg cag cca cca cgc ggt ggt cga ccg gca 480 
His Pro Asp Asp Glu Pro Pro Gin Pro Pro Arg Gly Gly Arg Pro Ala 
145 150 155 160 

tta cgc gtt gtg aag taa 498 
Leu Arg Val Val Lys * 
165 



<210> 158 
<211> 639 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (639) 

<400> 158 

atg get gtc get gee aac aaa cgt teg gta atg acg ctg ttt tec ggt 48 
Met Ala Val Ala Ala Asn Lys Arg Ser Val Met Thr Leu Phe Ser Gly 
15 10 15 

cct act gac ate tat age cat cag gtc cgc att gtg ctg get gag aaa 96 
Pro Thr Asp lie Tyr Ser His Gin Val Arg lie Val Leu Ala Glu Lys 
20 25 30 

9Tgt STta agt ttc gag ate gaa cac gtg gaa aag gac aat ccg cct cag 144 
Gly Val Ser Phe Glu lie Glu His Val Glu Lys Asp Asn Pro Pro Gin 
35 40 45 

gat ctg att gac etc aac ccg aat cag age gtt ccg acc ctg gtg gat 192 
Asp Leu lie Asp Leu Asn Pro Asn Gin Ser Val Pro Thr Leu Val Asp 
50 55 60 

cgt gag ctg acc ctg tgg gaa tct cgc ate att atg gaa tat ctg gat 240 
Arg Glu Leu Thr Leu Trp Glu Ser Arg lie lie Met Glu Tyr Leu Asp 
65 70 75 80 

gag cgt ttc ccg cat ccg cca ctg atg cct gtt tac ccg gta get cgc 288 
Glu Arg Phe Pro His Pro Pro Leu Met Pro Val Tyr Pro Val Ala Arg 
85 90 95 

ggt gaa age cgt ctg tac atg cat cgc ate gaa aaa gac tgg tac acg 336 
Gly Glu Ser Arg Leu Tyr Met His Arg lie Glu Lys Asp Trp Tyr Thr 
100 105 110 
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ctg atg aac acc ate ate aac ggt tea get tct gaa gca gat gee gca 384 
Leu Met Asn Thr lie lie Asn Gly Ser Ala Ser Glu Ala Asp Ala Ala 
115 120 125 

cgt aag caa ctg cgc gaa gaa ctg ctg gcg att gcg ccg gtc ttc ggt 432 
Arg Lys Gin Leu Arg Glu Glu Leu Leu Ala He Ala Pro Val Phe Gly 
130 135 140 

cag aag ccg tac ttc ctg age gat gag ttc age ctg gtc gat tgc tat 480 
Gin Lys Pro Tyr Phe Leu Ser Asp Glu Phe Ser Leu Val Asp Cys Tyr 
145 150 155 160 

ctt get ccg ctg ctg tgg cgt ctg ccg caa ctg ggc ate gag ttc age 528 
Leu Ala Pro Leu Leu Trp Arg Leu Pro Gin Leu Gly He Glu Phe Ser 
165 170 175 

ggc ccg ggt gcg aaa gag ctg aaa ggc tat atg acc cgc gtc ttt gag 576 
Gly Pro Gly Ala Lys Glu Leu Lys Gly Tyr Met Thr Arg Val Phe Glu 
180 185 190 

cgt gac tct ttc ctt get tct tta act gaa gca gaa cgt gaa atg cgt 624 
Arg Asp Ser Phe Leu Ala Ser Leu Thr Glu Ala Glu Arg Glu Met Arg 
195 200 205 

ctg ggc egg agt taa 639 
Leu Gly Arg Ser * 
210 



<210> 159 
<211> 1518 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
<222> (1)...(1518) 

<400> 159 

atg tct gaa caa cac 
Met Ser Glu Gin His 
1 5 

aat gaa ctg aaa acg 
Asn Glu Leu Lys Thr 
20 

ggg att gec ttc ccg 
Gly He Ala Phe Pro 
35 

ttg cac gca gaa ttc 
Leu His Ala Glu Phe 
50 

aac ate gaa gtc gec 
Asn He Glu Val Ala 
65 

ggt aaa gcg tct ttc 
Gly Lys Ala Ser Phe 



gca cag ggc get gac gcg gta gtc gat ctt aac 48 
Ala Gin Gly Ala Asp Ala Val Val Asp Leu Asn 
10 15 

cgt cgt gag aag ctg gcg aac ctg cgc gag cag 96 
Arg Arg Glu Lys Leu Ala Asn Leu Arg Glu Gin 
25 30 

aac gat ttc cgt cgc gat cat acc tct gac caa 144 
Asn Asp Phe Arg Arg Asp His Thr Ser Asp Gin 
40 45 

gac ggc aaa gag aac gaa gaa ctg gaa gcg ctg 192 
Asp Gly Lys Glu Asn Glu Glu Leu Glu Ala Leu 
55 60 

gtt get ggc cgc atg atg acc cgt cgt att atg 240 
Val Ala Gly Arg Met Met Thr Arg Arg He Met 
70 75 80 

gtt acc ctg cag gac gtt ggc ggt cgc att cag 288 
Val Thr Leu Gin Asp Val Gly Gly Arg He Gin 
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ctg tac gtt gcc cgt gac gat etc ccg gaa ggc gtt tat aac gag cag 336 
Leu Tyr Val Ala Arg Asp Asp Leu Pro Glu Gly Val Tyr Asn Glu Gin 
100 105 110 

ttc aaa aaa tgg gac etc ggc gac ate etc ggc gcg aaa ggt aag ctg 384 
Phe Lys Lys Trp Asp Leu Gly Asp lie Leu Gly Ala Lys Gly Lys Leu 
115 120 125 

ttc aaa acc aaa acc ggc gaa ctg tct ate cac tgc acc gag ttg cgt 432 
Phe Lys Thr Lys Thr Gly Glu Leu Ser lie His Cys Thr Glu Leu Arg 
130 135 140 

ctg ctg acc aaa gca ctg cgt ccg ctg ccg gat aaa ttc cac ggc ttg 480 
Leu Leu Thr Lys Ala Leu Arg Pro Leu Pro Asp Lys Phe His Gly Leu 
145 150 155 160 

cag gat cag gaa gcg cgc tat cgt cag cgt tat etc gat etc ate tec 528 
Gin Asp Gin Glu Ala Arg Tyr Arg Gin Arg Tyr Leu Asp Leu lie Ser 
165 170 175 

aac gat gaa tec cgc aac acc ttt aaa gtg cgc teg cag ate etc tct 576 
Asn Asp Glu Ser Arg Asn Thr Phe Lys Val Arg Ser Gin lie Leu Ser 
180 185 190 

ggt att cgc cag ttc atg gtg aac cgc ggc ttt atg gaa gtt gaa acg 624 
Gly He Arg Gin Phe Met Val Asn Arg Gly Phe Met Glu Val Glu Thr 
195 200 205 

ccg atg atg cag gtg ate cct ggc ggt gcc get gcg cgt ccg ttt ate 672 
Pro Met Met Gin Val He Pro Gly Gly Ala Ala Ala Arg Pro Phe He 
210 215 220 

acc cac cat aac gcg ctg gat etc gac atg tac ctg cgt ate gcg ccg 720 
Thr His His Asn Ala Leu Asp Leu Asp Met Tyr Leu Arg He Ala Pro 
225 230 235 240 

gaa ctg tac etc aag cgt ctg gtg gtt ggt ggc ttc gag cgt gta ttc 768 
Glu Leu Tyr Leu Lys Arg Leu Val Val Gly Gly Phe Glu Arg Val Phe 
245 250 255 

gaa ate aac cgt aac ttc cgt aac gaa ggt att tec gta cgt cat aac 816 
Glu He Asn Arg Asn Phe Arg Asn Glu Gly He Ser Val Arg His Asn 
260 265 270 

cca gag ttc acc atg atg gaa etc tac atg get tac gca gat tac aaa 864 
Pro Glu Phe Thr Met Met Glu Leu Tyr Met Ala Tyr Ala Asp Tyr Lys 
275 280 285 

gat ctg ate gag ctg acc gaa teg ctg ttc cgt act ctg gca cag gat 912 
Asp Leu He Glu Leu Thr Glu Ser Leu Phe Arg Thr Leu Ala Gin Asp 
290 295 300 

att etc ggt aag acg gaa gtg acc tac ggc gac gtg acg ctg gac ttc 960 
He Leu Gly Lys Thr Glu Val Thr Tyr Gly Asp Val Thr Leu Asp Phe 
305 310 315 320 

ggt aaa ccg ttc gaa aaa ctg acc atg cgt gaa gcg ate aag aaa tat 1008 
Gly Lys Pro Phe Glu Lys Leu Thr Met Arg Glu Ala He Lys Lys Tyr 
325 330 335 
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cgc ccg gaa acc gac atg gcg gat ctg gac aac ttc gac tct gcg aaa 1056 
Arg Pro Glu Thr Asp Met Ala Asp Leu Asp Asn Phe Asp Ser Ala Lys 
340 345 350 

gca att get gaa tct ate ggc ate cac gtt gag aag age tgg ggt ctg 1104 
Ala lie Ala Glu Ser He Gly He His Val Glu Lys Ser Trp Gly Leu 
355 360 365 

ggc cgt ate gtt acc gag ate ttc gaa gaa gtg gca gaa gca cat ctg 1152 
Gly Arg He Val Thr Glu He Phe Glu Glu Val Ala Glu Ala His Leu 
370 375 380 

att cag ccg acc ttc att act gaa tat ccg gca gaa gtt tct ccg ctg 1200 
He Gin Pro Thr Phe He Thr Glu Tyr Pro Ala Glu Val Ser Pro Leu 
385 390 395 400 

gcg cgt cgt aac gac gtt aac ccg gaa ate aca gac cgc ttt gag ttc 1248 
Ala Arg Arg Asn Asp Val Asn Pro Glu He Thr Asp Arg Phe Glu Phe 
405 410 415 

ttc att ggt ggt cgt gaa ate ggt aac ggc ttt age gag ctg aat gac 1296 
Phe He Gly Gly Arg Glu He Gly Asn Gly Phe Ser Glu Leu Asn Asp 
420 425 430 

gcg gaa gat cag gcg caa cgc ttc ctg gat cag gtt gee gcg aaa gac 1344 
Ala Glu Asp Gin Ala Gin Arg Phe Leu Asp Gin Val Ala Ala Lys Asp 
435 440 445 

gca ggt gac gac gaa gcg atg ttc tac gat gaa gat tac gtc acc gca 1392 
Ala Gly Asp Asp Glu Ala Met Phe Tyr Asp Glu Asp Tyr Val Thr Ala 
450 455 460 

ctg gaa cat ggc tta ccg ccg aca gca ggt ctg gga att ggt ate gac 1440 
Leu Glu His Gly Leu Pro Pro Thr Ala Gly Leu Gly He Gly He Asp 
465 470 475 480 

cgt atg gta atg ctg ttc acc aac age cat acc ate cgc gac gtt att 1488 
Arg Met Val Met Leu Phe Thr Asn Ser His Thr He Arg Asp Val He 
485 490 495 

ctg ttc ccg gcg atg cgt ccg gta aaa taa 1518 
Leu Phe Pro Ala Met Arg Pro Val Lys * 
500 505 



<210> 160 
<211> 1098 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1098) 

<400> 160 

atg ttt gaa att aat ccg gta aat aat cgc att cag gac etc acg gaa 48 
Met Phe Glu He Asn Pro Val Asn Asn Arg He Gin Asp Leu Thr Glu 
15 10 15 

cgc tec gac gtt ctt agg ggg tat ctt gac tac gac gec aag aaa gag 96 



-106- 



WO 01/48209 



PCT/US00/34419 



Arg Ser Asp Val Leu Arg Gly Tyr Leu Asp Tyr Asp Ala Lys Lys Glu 
20 25 30 

cgt ctg gaa gaa gta aac gcc gag ctg gaa cag ccg gat gtc tgg aac 144 
Arg Leu Glu Glu Val Asn Ala Glu Leu Glu Gin Pro Asp Val Trp Asia 
35 40 45 

gaa ccc gaa cgc gca cag gcg ctg ggt aaa gag cgt tec tec etc gaa 192 
Glu Pro Glu Arg Ala Gin Ala Leu Gly Lys Glu Arg Ser Ser Leu Glu 
50 55 60 

gcc gtt gtc gac acc etc gac caa atg aaa cag ggg ctg gaa gat gtt 240 
Ala Val Val Asp Thr Leu Asp Gin Met Lys Gin Gly Leu Glu Asp Val 
65 70 75 80 

tct ggt ctg ctg gaa ctg get gta gaa get gac gac gaa gaa acc ttt 288 
Ser Gly Leu Leu Glu Leu Ala Val Glu Ala Asp Asp Glu Glu Thr Phe 
85 90 95 

aac gaa gcc gtt get gaa etc gac gcc ctg gaa gaa aaa ctg gcg cag 336 
Asn Glu Ala Val Ala Glu Leu Asp Ala Leu Glu Glu Lys Leu Ala Gin 
100 105 110 

ctt gag ttc cgc cgt atg ttc tct ggc gaa tat gac age gcc gac tgc 384 
Leu Glu Phe Arg Arg Met Phe Ser Gly Glu Tyr Asp Ser Ala Asp Cys 
115 120 125 

tac etc gat att cag gcg ggg tct ggc ggt acg gaa gca cag gac tgg 432 
Tyr Leu Asp lie Gin Ala Gly Ser Gly Gly Thr Glu Ala Gin Asp Trp 
130 135 140 

gcg age atg ctt gag cgt atg tat ctg cgc tgg gca gaa teg cgt ggt 480 
Ala Ser Met Leu Glu Arg Met Tyr Leu Arg Trp Ala Glu Ser Arg Gly 
145 150 155 160 

ttc aaa act gaa ate ate gaa gag teg gaa ggt gaa gtg gcg ggt att 528 
Phe Lys Thr Glu lie lie Glu Glu Ser Glu Gly Glu Val Ala Gly He 
165 170 175 

aaa tec gtg acg ate aaa ate tec ggc gat tac get tac ggc tgg ctg 576 
Lys Ser Val Thr He Lys He Ser Gly Asp Tyr Ala Tyr Gly Trp Leu 
180 185 190 

cgt aca gaa acc ggc gtt cac cgc ctg gtg cgt aaa age ccg ttt gac 624 
Arg Thr Glu Thr Gly Val His Arg Leu Val Arg Lys Ser Pro Phe Asp 
195 200 205 

/ 

tec ggc ggt cgt cgc cac acg teg ttc age tec gcg ttt gtt tat ccg 672 
Ser Gly Gly Arg Arg His Thr Ser Phe Ser Ser Ala Phe Val Tyr Pro 
210 215 220 

gaa gtt gat gat gat att gat ate gaa ate aac ccg gcg gat ctg cgc 720 
Glu Val Asp Asp Asp He Asp He Glu He Asn Pro Ala Asp Leu Arg 
225 230 235 240 

att gac gtt tat cgc acg tec ggc gcg ggc ggt cag cac gtt aac cgt 768 
He Asp Val Tyr Arg Thr Ser Gly Ala Gly Gly Gin His Val Asn Arg 
245 250 255 

acc gaa tct gcg gtg cgt att acc cac ate ccg acc ggg ate gtg acc 816 
Thr Glu Ser Ala Val Arg He Thr His He Pro Thr Gly He Val Thr 
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260 265 270 

cag tgc cag aac gac cgt tec cag cac aag aac aaa gat cag gec atg 864 
Gin Cys Gin Asn Asp Arg Ser Gin His Lys Asn Lys Asp Gin Ala Met 
275 280 285 

aag cag atg aaa gcg aag ctt tat gaa ctg gag atg cag aag aaa aat 912 
Lys Gin Met Lys Ala Lys Leu Tyr Glu Leu Glu Met Gin Lys Lys Asn 
290 295 300 

gec gag aaa cag gcg atg gaa gat aac aaa tec gac ate ggc tgg ggc 960 
Ala Glu Lys Gin Ala Met Glu Asp Asn Lys Ser Asp lie Gly Trp Gly 
305 310 315 320 

age cag att cgt tct tat gtc ctt gat gac tec cgc att aaa gat ctg 1008 
Ser Gin lie Arg Ser Tyr Val Leu Asp Asp Ser Arg lie Lys Asp Leu 
325 330 335 

cgc acc ggg gta gaa ace cgc aac acg cag gec gtg ctg gac ggc age 1056 
Arg Thr Gly Val Glu Thr Arg Asn Thr Gin Ala Val Leu Asp Gly Ser 
340 345 350 

ctg gat caa ttt ate gaa gca agt ttg aaa gca ggg tta tga 1098 
Leu Asp Gin Phe lie Glu Ala Ser Leu Lys Ala Gly Leu * 
355 360 365 



<210> 161 
<211> 1734 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1734) 

<400> 161 

gtg aaa caa cag ata caa ctt cgt cgc cgt gaa gtc gat gaa acg gca 48 

Met Lys Gin Gin lie Gin Leu Arg Arg Arg Glu Val Asp Glu Thr Ala 
1 5 10 15 

gac ttg ccc get gaa ttg cct ccc ttg ctg cgc cgt tta tac gec age 96 
Asp Leu Pro Ala Glu Leu Pro Pro Leu Leu Arg Arg Leu Tyr Ala Ser 
20 25 30 

c 99 gga gta cgc agt gcg caa gaa ctg gaa cgc agt gtt aaa ggt atg 144 
Arg Gly Val Arg Ser Ala Gin Glu Leu Glu Arg Ser Val Lys Gly Met 
35 40 45 

ctg ccc tgg cag caa ctg age ggc gtc gaa aag gec gtt gag ate ctt 192 
Leu Pro Trp Gin Gin Leu Ser Gly Val Glu Lys Ala Val Glu lie Leu 
50 55 60 

tac aac get ttt cgc gaa gga acg egg att att gtg gtc ggt gat ttc 240 
Tyr Asn Ala Phe Arg Glu Gly Thr Arg lie lie Val Val Gly Asp Phe 
65 70 75 80 

gac gee gac ggc gcg acc age acg get eta age gtg ctg gcg atg cgc 288 
Asp Ala Asp Gly Ala Thr Ser Thr Ala Leu Ser Val Leu Ala Met Arg 
85 90 95 
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teg ctt ggt tgc age aat ate gac tac ctg gta cca aac cgt ttc gaa 336 
Ser Leu Gly Cys Ser Asn He Asp Tyr Leu Val Pro Asn Arg Phe Glu 
100 105 110 

gac ggt tac ggc tta age ccg gaa gtg gtc gat cag gee cat gee cgt 384 
Asp Gly Tyr Gly Leu Ser Pro Glu Val Val Asp Gin Ala His Ala Arg 
115 120 125 

ggc gcg cag tta att gtc acg gtg gat aac ggt att tec tec cat gcg 432 
Gly Ala Gin Leu He Val Thr Val Asp Asn Gly He Ser Ser His Ala 
130 135 140 

ggg gtt gag cac get cgc teg ttg ggc ate ccg gtt att gtt acc gat 480 
Gly Val Glu His Ala Arg Ser Leu Gly He Pro Val He Val Thr Asp 
145 150 155 160 

cac cat ttg cca ggc gac aca tta ccc gca gcg gaa gcg ate att aac 528 
His His Leu Pro Gly Asp Thr Leu Pro Ala Ala Glu Ala He He Asn 
165 170 175 

cct aac ttg cgc gac tgt aat ttc ccg teg aaa tea ctg gca ggc gtg 576 
Pro Asn Leu Arg Asp Cys Asn Phe Pro Ser Lys Ser Leu Ala Gly Val 
180 185 190 

ggt gtg gcg ttt tat ctg atg ctg gcg ctg cgc acc ttt ttg cgc gat 624 
Gly Val Ala Phe Tyr Leu Met Leu Ala Leu Arg Thr Phe Leu Arg Asp 
195 200 205 

cag ggc tgg ttt gat gag cgt aac ate gca att cct aac ctg gca gaa 672 
Gin Gly Trp Phe Asp Glu Arg Asn He Ala He Pro Asn Leu Ala Glu 
210 215 220 

ctg ctg gat ctg gtc gcg ctg ggg aca gtg gcg gac gtc gtg ccg ctg 720 
Leu Leu Asp Leu Val Ala Leu Gly Thr Val Ala Asp Val Val Pro Leu 
225 230 235 240 

gac get aat aat cgc att ctg acc tgg cag ggg atg agt cgc ate cga 768 
Asp Ala Asn Asn Arg He Leu Thr Trp Gin Gly Met Ser Arg He Arg 
245 250 255 

gee gga aag tgc cgt ccg ggg att aaa gcg ctg ctt gaa gtg gca aac 816 
Ala Gly Lys Cys Arg Pro Gly He Lys Ala Leu Leu Glu Val Ala Asn 
260 265 270 

cgt gat gca caa aaa etc gee gee age gat tta ggt ttt gcg ctg ggg 864 
Arg Asp Ala Gin Lys Leu Ala Ala Ser Asp Leu Gly Phe Ala Leu Gly 
275 280 285 

cca cgt etc aat get gee gga cga ctg gac gat atg tec gtc ggt gtg 912 
Pro Arg Leu Asn Ala Ala Gly Arg Leu Asp Asp Met Ser Val Gly Val 
290 295 300 

gcg ctg ttg ttg tgc gac aac ate ggc gaa gcg cgc gtg ctg gca aat 960 
Ala Leu Leu Leu Cys Asp Asn He Gly Glu Ala Arg Val Leu Ala Asn 
305 310 315 320 

gaa etc gat gcg eta aac cag acg cga aaa gag ate gaa caa gga atg 1008 
Glu Leu Asp Ala Leu Asn Gin Thr Arg Lys Glu He Glu Gin Gly Met 
325 330 335 
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caa att gaa gcc ctg acc ctg tgc gag aaa ctg gag cgc age cgt gac 1056 
Gin lie Glu Ala Leu Thr Leu Cys Glu Lys Leu Glu Arg Ser Arg Asp 
340 345 350 

acg eta ccc ggc ggg ctg gca atg tat cac ccc gaa tgg cat cag ggc 1104 
Thr Leu Pro Gly Gly Leu Ala Met Tyr His Pro Glu Trp His Gin Gly 
355 360 365 

gtt gtc ggt att ctg get teg cgc ate aaa gag cgt ttt cac cgt ccg 1152 
Val Val Gly He Leu Ala Ser Arg He Lys Glu Arg Phe His Arg Pro 
370 375 380 

gtt ate gcg ttt gcg cca gca ggt gac ggt acg ctg aaa ggt tec ggt 1200 
Val He Ala Phe Ala Pro Ala Gly Asp Gly Thr Leu Lys Gly Ser Gly 
385 390 395 400 

cgc tec att cag ggg ctg cat atg cgt gat gcg ctg gag cga tta gac 1248 
Arg Ser lie Gin Gly Leu His Met Arg Asp Ala Leu Glu Arg Leu Asp 
405 410 415 

aca etc tac cct ggc atg atg ctg aag ttt ggc ggt cat gcg atg gcg 1296 
Thr Leu Tyr Pro Gly Met Met Leu Lys Phe Gly Gly His Ala Met Ala 
420 425 430 

gcg ggt ttg teg ctg gaa gag gat aaa ttc aaa etc ttt caa caa egg 1344 
Ala Gly Leu Ser Leu Glu Glu Asp Lys Phe Lys Leu Phe Gin Gin Arg 
435 440 445 

ttt ggc gaa ctg gtt act gag tgg ctg gac cct teg eta ttg caa ggc 1392 
Phe Gly Glu Leu Val Thr Glu Trp Leu Asp Pro Ser Leu Leu Gin Gly 
450 455 460 

gaa gtg gta tea gac ggt ccg tta age ccg gcc gaa atg acc atg gaa 1440 
Glu Val Val Ser Asp Gly Pro Leu Ser Pro Ala Glu Met Thr Met Glu 
465 470 475 480 

gtg gcg cag ctg ctg cgc gat get ggc ccg tgg ggg cag atg ttc ccg 1488 
Val Ala Gin Leu Leu Arg Asp Ala Gly Pro Trp Gly Gin Met Phe Pro 
485 490 495 

gag ccg ctg ttt gac ggt cat ttc cgt ctg ctg caa cag egg ctg gtg 1536 
Glu Pro Leu Phe Asp Gly His Phe Arg Leu Leu Gin Gin Arg Leu Val 
500 505 510 

ggc gaa cgt cat ttg aag gtg atg gtc gaa ccg gtc ggc ggc ggt cca 1584 
Gly Glu Arg His Leu Lys Val Met Val Glu Pro Val Gly Gly Gly Pro 
515 520 525 

ctg ctg gat ggt att get ttt aat gtc gat acc gcc etc tgg ccg gat 1632 
Leu Leu Asp Gly He Ala Phe Asn Val Asp Thr Ala Leu Trp Pro Asp 
530 535 540 

aac ggc gtg cgc gaa gtg caa ctg get tat aag etc gat ate aac gag 1680 
Asn Gly Val Arg Glu Val Gin Leu Ala Tyr Lys Leu Asp He Asn Glu 
545 550 555 560 

ttt cgc ggc aac cgc age ctg caa att ate ate gac aat ate tgg cca 1728 
Phe Arg Gly Asn Arg Ser Leu Gin He He He Asp Asn He Trp Pro 
565 570 575 

att tag 1734 
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He * 



<210> 162 
<211> 711 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (711) 

<400> 162 

atg aag aaa ggt ttt atg ttg ttt act ttg tta gcg gcg ttt tea ggc 48 

Met Lys Lys Gly Phe Met Leu Phe Thr Leu Leu Ala Ala Phe Ser Gly 
1 5 10 15 

ttt get cag get gat gac gcg gca att caa caa acg tta gec aaa atg 96 
Phe Ala Gin Ala Asp Asp Ala Ala He Gin Gin Thr Leu Ala Lys Met 
20 25 30 

ggc ate aaa age age gat att cag ccc gcg cct gta get ggc atg aag 144 
Gly He Lys Ser Ser Asp He Gin Pro Ala Pro Val Ala Gly Met Lys 
35 40 45 

aca gtt ctg act aac age ggc gtg ttg tac ate acc gat gat ggt aaa 192 
Thr Val Leu Thr Asn Ser Gly Val Leu Tyr He Thr Asp Asp Gly Lys 
50 55 60 

cat ate att cag ggg cca atg tat gac gtt agt ggc acg get ccg gtc 240 
His He He Gin Gly Pro Met Tyr Asp Val Ser Gly Thr Ala Pro Val 
65 70 75 80 

aat gtc acc aat aag atg ctg tta aag cag ttg aat gcg ctt gaa aaa 288 
Asn Val Thr Asn Lys Met Leu Leu Lys Gin Leu Asn Ala Leu Glu Lys 
85 90 95 

gag atg ate gtt tat aaa gcg ccg cag gaa aaa cac gtc ate acc gtg 336 
Glu Met He Val Tyr Lys Ala Pro Gin Glu Lys His Val He Thr Val 
100 105 110 

ttt act gat att acc tgt ggt tac tgc cac aaa ctg cat gag caa atg 384 
Phe Thr Asp He Thr Cys Gly Tyr Cys His Lys Leu His Glu Gin Met 
115 120 125 

gca gac tac aac gcg ctg ggg ate acc gtg cgt tat ctt get ttc ccg 432 
Ala Asp Tyr Asn Ala Leu Gly He Thr Val Arg Tyr Leu Ala Phe Pro 
130 135 140 

cgc cag ggg ctg. gac age gat gca gag aaa gaa atg aaa get ate tgg 480 
Arg Gin Gly Leu Asp Ser Asp Ala Glu Lys Glu Met Lys Ala He Trp 
145 150 155 160 

tgt gcg aaa gat aaa aac aaa gcg ttt gat gat gtg atg gca ggt aaa 528 
Cys Ala Lys Asp Lys Asn Lys Ala Phe Asp Asp Val Met Ala Gly Lys 
165 170 175 

age gtc gca cca gec agt tgc gac gtg gat att gee gac cat tac gca 576 
Ser Val Ala Pro Ala Ser Cys Asp Val Asp He Ala Asp His Tyr Ala 
180 185 190 
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ctt ggc gtc cag ctt ggc gtt age ggt act ccg gca gtt gtg ctg age 624 
Leu Gly Val Gin Leu Gly Val Ser Gly Thr Pro Ala Val. Val Leu Ser 
195 200 205 

aat ggc aca ctt gtt ccg ggt tac cag ccg ccg aaa gag atg aaa gaa 672 
Asn Gly Thr Leu Val Pro Gly Tyr Gin Pro Pro Lys Glu Met Lys Glu 
210 215 220 

ttc etc gac gaa cac caa aaa atg acc age ggt aaa taa 711 
Phe Leu Asp Glu His Gin Lys Met Thr Ser Gly Lys * 
225 230 235 



<210> 163 
<2ll> 897 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (897) 

<400> 163 

gtg aaa cag gat ctg gca cge ate gag cag ttt ctt gat get ctg tgg 48 
Met Lys Gin Asp Leu Ala Arg lie Glu Gin Phe Leu Asp Ala Leu Trp 
15 10 15 

ctg gaa aaa aat ctg get gaa aat acg ttg aac get tac cgt cge gat 96 
Leu Glu Lys Asn Leu Ala Glu Asn Thr Leu Asn Ala Tyr Arg Arg Asp 
20 25 30 

ctg tea atg atg gtg gag tgg ttg cat cac cge ggg ttg acg ctg gcg 144 
Leu Ser Met Met Val Glu Trp Leu His His Arg Gly Leu Thr Leu Ala 
35 '40 45 

acg gcg caa agt gac gat ttg cag gca tta ctg gca gaa egg ctg gag 192 
Thr Ala Gin Ser Asp Asp Leu Gin Ala Leu Leu Ala Glu Arg Leu Glu 
50 55 60 

ggc ggg tat aaa gee acc age tea gcg cgt ttg ctg agt gca gtg cge 240 
Gly Gly Tyr Lys Ala Thr Ser Ser Ala Arg Leu Leu Ser Ala Val Arg 
65 70 75 80 

cga ttg ttc cag tat ctt tat cge gaa aag ttt cgt gaa gac gat ccc 288 
Arg Leu Phe Gin Tyr Leu Tyr Arg Glu Lys Phe Arg Glu Asp Asp Pro 
85 90 95 

agt gcg cat etc get tea ccg aaa ttg ccc cag cgt ttg cca aaa gat 336 
Ser Ala His Leu Ala Ser Pro Lys Leu Pro Gin Arg Leu Pro Lys Asp 
100 105 110 



tta age gaa gcg cag gtc gaa cgt tta tta cag gca cca tta ate gat 384 
Leu Ser Glu Ala Gin Val Glu Arg Leu Leu Gin Ala Pro Leu lie Asp 
115 120 125 

cag cca ctg gag eta cge gat aaa gee atg ctt gaa gtg ttg tat get 432 
Gin Pro Leu Glu Leu Arg Asp Lys Ala Met Leu Glu Val Leu Tyr Ala 
130 135 140 

acc ggg ctg cgt gtc tct gaa ctg gtc gga ctg aca atg agt gat ate 480 
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Thr Gly Leu Arg Val Ser Glu Leu Val Gly Leu Thr Met Ser Asp He 
145 150 155 160 

age ctg cgt cag ggc gtg gta egg gtc att ggt aaa ggc aac aaa gag 528 
Ser Leu Arg Gin Gly Val Val Arg Val He Gly Lys Gly Asn Lys Glu 
165 170 175 

cgt ctg gtg ccg tta ggt gaa gag gcg gtt tac tgg ctg gaa acc tat 576 
Arg Leu Val Pro Leu Gly Glu Glu Ala Val Tyr Trp Leu Glu Thr Tyr 
180 185 190 

ctg gaa cat ggg cgt ccg tgg ctg ctg aat ggt gtg tea att gac gtg 624 
Leu Glu His Gly Arg Pro Trp Leu Leu Asn Gly Val Ser lie Asp Val 
195 200 205 

ttg ttt ccc age cag cgt gcg cag cag atg acg cga cag acc ttc tgg 672 
Leu Phe Pro Ser Gin Arg Ala Gin Gin Met Thr Arg Gin Thr Phe Trp 
210 215 220 

cac cgt att aaa cat tat get gtg ctg gcg ggt ate gac age gaa aag 720 
His Arg lie Lys His Tyr Ala Val Leu Ala Gly lie Asp Ser Glu Lys 
225 230 235 240 

ctg tea ccg cat gtg ttg cgt cac get ttt gee act cat tta tta aat 768 
Leu Ser Pro His Val Leu Arg His Ala Phe Ala Thr His Leu Leu Asn 
245 250 255 

cat ggt gcg gat tta cgc gtg gtg cag atg eta ctg ggc cac age gat 816 
His Gly Ala Asp Leu Arg Val Val Gin Met Leu Leu Gly His Ser Asp 
260 265 270 

etc tec acc acg caa att tat acg cat gtc get acc gag cgt ctg egg 864 
Leu Ser Thr Thr Gin lie Tyr Thr His Val Ala Thr Glu Arg Leu Arg 
275 280 285 

caa ctt cat caa cag cat cac ccg egg gcg tga 897 
Gin Leu His Gin Gin His His Pro Arg Ala * 
290 295 



<210> 164 
<211> 1104 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1104) 

<400> 164 

atg agt gac age cag acg ctg gtg gta aaa etc ggc acc agt gtg eta 48 
Met Ser Asp Ser Gin Thr Leu Val Val Lys Leu Gly Thr Ser Val Leu 
15 10 15 

aca ggc gga teg cgc cgt ctg aac cgt gee cat ate gtt gaa ctt gtt 96 
Thr Gly Gly Ser Arg Arg Leu Asn Arg Ala His He Val Glu Leu Val 
20 25 30 

cgc cag tgc gcg cag tta cat gec gee ggg cat egg att gtt att gtg 144 
Arg Gin Cys Ala Gin Leu His Ala Ala Gly His Arg He Val He Val 
35 40 45 
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acg teg ggc gcg ate gee gee gga cgt gag cac ctg ggt tac ccg gaa 192 
Thr Ser Gly Ala lie Ala Ala Gly. Arg Glu His Leu Gly Tyr Pro Glu 
50 55 60 

ctg cca gcg acc ate gec teg aaa caa ctg ctg gcg gcg gta ggg cag 240 
Leu Pro Ala Thr lie Ala Ser Lys Gin Leu Leu Ala Ala Val Gly Glu 
65 70 75 80 

agt cga ctg att caa ctg tgg gaa cag ctg ttt teg att tat ggc att 288 
Ser Arg Leu lie Gin Leu Trp Glu Gin Leu Phe Ser lie Tyr Gly He 
85 90 95 

cac gtc ggg caa atg ctg ctg acc cgt get gat atg gaa gac cgt gaa 336 
His Val Gly Gin Met Leu Leu Thr Arg Ala Asp Met Glu Asp Arg Glu 
100 105 110 

cgc ttc ctg aac gee cgc gac acc ctg cga gcg ttg etc gat aac aat 384 
Arg Phe Leu Asn Ala Arg Asp Thr Leu Arg Ala Leu Leu Asp Asn Asn 
115 120 125 

ate gtt ccg gta ate aat gag aac gat get gtc get acg gca gag att 432 
He Val Pro Val He Asn Glu Asn Asp Ala Val Ala Thr Ala Glu He 
130 135 140 

aag gtc ggc gat aac gat aac ctt tct gcg ctg gcg gcg att ctt gcg 48 0 
Lys Val Gly Asp Asn Asp Asn Leu Ser Ala Leu Ala Ala He Leu Ala 
145 150 155 160 

ggt gee gat aaa ctg ttg ctg ctg acc gat caa aaa ggt ttg tat acc 528 
Gly Ala Asp Lys Leu Leu Leu Leu Thr Asp Gin Lys Gly Leu Tyr Thr 
165 170 175 

get gac ccg cgc age aat ccg cag gca gaa ctg att aaa gat gtt tac 576 
Ala Asp Pro Arg Ser Asn Pro Gin Ala Glu Leu He Lys Asp Val Tyr 
180 185 190 

ggc att gat gac gca ctg cgc gcg att gee ggt gac age gtt tea ggc 624 
Gly He Asp Asp Ala Leu Arg Ala He Ala Gly Asp Ser Val Ser Gly 
195 200 * 205 

etc gga act ggc ggc atg agt acc aaa ttg cag gee get gac gtg get 672 
Leu Gly Thr Gly Gly Met Ser Thr Lys Leu Gin Ala Ala Asp Val Ala 
210 215 220 

tgc cgt gcg ggt ate gac acc att att gee gcg ggc age aag ccg ggc 720 
Cys Arg Ala Gly He Asp Thr He He Ala Ala Gly Ser Lys Pro Gly 
225 ~ 230 235 240 

gtt att ggt gat gtg atg gaa ggc att tec gtc ggt acg ctg ttc cat 768 
Val He Gly Asp Val Met Glu Gly He Ser Val Gly Thr Leu Phe His 
245 250 255 

gee cag gcg act ccg ctt gaa aac cgt aaa cgc tgg att ttc ggt gcg 816 
Ala Gin Ala Thr Pro Leu Glu Asn Arg Lys Arg Trp He Phe Gly Ala 
260 265 270 

ccg ccg gcg ggt gaa ate acg gta gat gaa ggg gca act gee gec att 864 
Pro Pro Ala Gly Glu He Thr Val Asp Glu Gly Ala Thr Ala Ala He 
275 280 285 
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ctg gaa cgc ggc age tec ctg ttg ccg aaa ggc att aaa age gtg act 912 
Leu Glu Arg Gly Ser Ser Leu Leu Pro Lys Gly He Lys Ser Val Thr 
290 295 300 

ggc aat ttc teg cgt ggt gaa gtc ate cgc att tgc aac etc gaa ggc 960 
Gly Asn Phe Ser Arg Gly Glu Val He Arg He Cys Asn Leu Glu Gly 
305 310 315 320 

cgc gat ate gec cac ggc gtc agt cgt tac aac age gat gca tta cgc 1008 
Arg Asp He Ala His Gly Val Ser Arg Tyr Asn Ser Asp Ala Leu Arg 
325 330 335 

cgt att gee gga cac cac teg caa gaa att gat gca ata ctg gga tat 1056 
Arg He Ala Gly His His Ser Gin Glu He Asp Ala He Leu Gly Tyr 
340 345 350 

gaa tac ggc ccg gtt gee gtt cac cgt gat gac atg att acc cgt taa 1104 
Glu Tyr Gly Pro Val Ala Val His Arg Asp Asp Met He Thr Arg * 
355 360 365 



<210> 165 
<211> 1254 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1254) 

<400> 165 

atg ctg gaa caa atg ggc att gee gcg aag caa gec teg tat aaa tta 48 
Met Leu Glu Gin Met Gly He Ala Ala Lys Gin Ala Ser Tyr Lys Leu 
1 5 10 15 

gcg caa etc tec age cgc gaa aaa aat cgc gtg ctg gaa aaa ate gec 96 
Ala Gin Leu Ser Ser Arg Glu Lys Asn Arg Val Leu Glu Lys He Ala 
20 25 30 

gat gaa ctg gaa gca caa age gaa ate ate etc aac get aac gee cag 144 
Asp Glu Leu Glu Ala Gin Ser Glu He He Leu Asn Ala Asn Ala Gin 
35 40 45 

gat gtt get gac gcg cga gec aat ggc ctt age gaa gcg atg ctt gac 192 
Asp Val Ala Asp Ala Arg Ala Asn Gly Leu Ser Glu Ala Met Leu Asp 
50 55 60 

cgt ctg gca ctg acg ccc gca egg ctg aaa ggc att gee gac gat gta 240 
Arg Leu Ala Leu Thr Pro Ala Arg Leu Lys Gly He Ala Asp Asp Val 
65 70 75 80 

cgt cag gtg tgc aac etc gec gat ccg gtg ggg cag gta ate gat ggc 288 
Arg Gin Val Cys Asn Leu Ala Asp Pro Val Gly Gin Val He Asp Gly 
85 90 95 

ggc gta ctg gac age ggc ctg cgt ctt gag cgt cgt cgc gta ccg ctg 336 
Gly Val Leu Asp Ser Gly Leu Arg Leu Glu Arg Arg Arg Val Pro Leu 
100 105 110 

ggg gtt att ggc gtg att tat gaa gcg cgc ccg aac gtg acg gtt gat 384 



-115- 



WO 01/48209 



PCT/US00/34419 



Gly Val lie Gly Val lie Tyr Glu Ala Arg Pro Asn Val Thr Val Asp 
115 120 125 

gtc get teg ctg tgc ctg aaa acc ggt aat gcg gtg ate ctg cgc ggt 432 
Val Ala Ser Leu Cys Leu Lys Thr Gly Asn Ala Val lie Leu Arg Gly 
130 135 140 

ggc aaa gaa acg tgt cgc act aac get gca acg gtg gcg gtg att cag 480 
Gly Lys Glu Thr Cys Arg Thr Asn Ala Ala Thr Val Ala Val lie Gin 
145 150 155 160 

gac gec ctg aaa tec tgc ggc tta ccg gcg ggt gee gtg cag gcg att 528 
Asp Ala Leu Lys Ser Cys Gly Leu Pro Ala Gly Ala Val Gin Ala lie 
165 170 175 

gat aat cct gac cgt gcg ctg gtc agt gaa atg ctg cgt atg gat aaa 576 
Asp Asn Pro Asp Arg Ala Leu Val Ser Glu Met Leu Arg Met Asp Lys 
180 185 190 

tac ate gac atg ctg ate ccg cgt ggt ggc get ggt ttg cat aaa ctg 624 
Tyr lie Asp Met Leu He Pro Arg Gly Gly Ala Gly Leu His Lys Leu 
195 200 205 

tgc cgt gaa cag teg aca ate ccg gtg ate aca ggt ggt ata ggc gta 672 
Cys Arg Glu Gin Ser Thr. He Pro Val He Thr Gly Gly He Gly Val 
210 215 220 

tgc cat att tac gtt gat gaa agt gta gag ate get gaa gca tta aaa 720 
Cys His He Tyr Val Asp Glu Ser Val Glu He Ala Glu Ala Leu Lys 
225 230 235 240 

gtg ate gtc aac gcg aaa act cag cgt ccg age aca tgt aat acg gtt 768 
Val He Val Asn Ala Lys Thr Gin Arg Pro Ser Thr Cys Asn Thr Val 
245 250 255 

gaa acg ttg ctg gtg aat aaa aac ate gee gat age ttc ctg ccc gca 816 
Glu Thr Leu Leu Val Asn Lys Asn He Ala Asp Ser Phe Leu Pro Ala 
260 265 270 

tta age aaa caa atg gcg gaa age ggc gtg aca tta cac gca gat gca 864 
Leu Ser Lys Gin Met Ala Glu Ser Gly Val Thr Leu His Ala Asp Ala 
275 280 285 

get gca ctg gcg cag ttg cag gca ggc cct gcg aag gtg gtt get gtt 912 
Ala Ala Leu Ala Gin Leu Gin Ala Gly Pro Ala Lys Val Val Ala Val 
290 295 300 

aaa gee gaa gag tat gac gat gag ttt ctg tea tta gat ttg aac gtc 960 
Lys Ala Glu Glu Tyr Asp Asp Glu Phe Leu Ser Leu Asp Leu Asn Val 
305 310 315 320 

aaa ate gtc age gat ctt gac gat gee ate gee cat att cgt gaa cac 1008 
Lys He Val Ser Asp Leu Asp Asp Ala He Ala His He Arg Glu His 
325 330 335 

ggc aca caa cac tec gat gcg ate ctg acc cgc gat atg cgc aac gee 1056 
Gly Thr Gin His Ser Asp Ala He Leu Thr Arg Asp Met Arg Asn Ala 
340 345 350 

cag cgt ttt gtt aac gaa gtg gat teg tec get gtt tac gtt aac gec 1104 
Gin Arg Phe Val Asn Glu Val Asp Ser Ser Ala Val Tyr Val Asn Ala 
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355 360 365 

tct acg cgt ttt acc gac ggc ggc cag ttt ggt ctg ggt gcg gaa gtg 1152 
Ser Thr Arg Phe Thr Asp Gly Gly Gin Phe Gly Leu Gly Ala Glu Val 
370 375 380 

gcg gta age aca caa aaa etc cac gcg cgt ggc cca atg ggg ctg gaa 1200 
Ala Val Ser Thr Gin Lys Leu His Ala Arg Gly Pro Met Gly Leu Glu 
385 390 395 400 

gca ctg acc act tac aag tgg ate ggc att ggt gat tac acc att cgt 1248 
Ala Leu Thr Thr Tyr Lys Tip He Gly He Gly Asp Tyr Thr lie Arg 
405 410 415 

gcg taa 1254 
Ala * 



<210> 166 
<211> 561 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (561) 

<400> 166 

atg atg aca agg cag gca tea atg aaa ggc ttc cca att gcg cat att 48 

Met Met Thr Arg Gin Ala Ser Met Lys Gly Phe Pro He Ala His He 
15 10 15 

ttt cac cct tea ate ccg cca atg cac gca gtg gtt aac aat cac aat 96 
Phe His Pro Ser He Pro Pro Met His Ala Val Val Asn Asn His Asn 
20 25 30 

aga aat att gat tat tgg acg gta aaa aga aag ttt gca gaa att gtc 144 
Arg Asn He Asp Tyr Trp Thr Val Lys Arg Lys Phe Ala Glu He Val 
35 40 45 

tec acc aat gac gtt aat aaa att tac agt at a agt aat gaa ctg egg 192 
Ser Thr Asn Asp Val Asn Lys He Tyr Ser He Ser Asn Glu Leu Arg 
50 55 60 

aga gta tta tct gca ata act gca ttg aat ttc tat cat ggc gat gtt 240 
Arg Val Leu Ser Ala He Thr Ala Leu Asn Phe Tyr His Gly ^sp Val 
65 70 75 80 

cct tct gtc atg ate cga ate caa ccg gaa aat atg agt cca ttc att 288 
Pro Ser Val Met He Arg He Gin Pro Glu Asn Met Ser Pro Phe He 
85 90 95 

ata gat att tct aca gga gaa cat gat gat tat ate ata caa aca tta 336 
He Asp He Ser Thr Gly Glu His Asp Asp Tyr He He Gin Thr Leu 
100 105 110 

gat gta ggc act ttt gca cct ttt ggt gaa caa tgt act tgc tea gee 384 
Asp Val Gly Thr Phe Ala Pro Phe Gly Glu Gin Cys Thr Cys Ser Ala 
115 120 125 
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gtc aat aaa aaa gag ctg gaa tgt att 
Val Asn Lys Lys Glu Leu Glu Cys He 
130 135 

tgt gca aaa ttc acc cga aaa gaa gcc 
Cys Ala Lys Phe Thr Arg Lys Glu Ala 
145 150 

ttt aat aaa acc age att act tea gat 
Phe Asn Lys Thr Ser He Thr Ser Asp 
165 

tea ccc gat cat ttt aat aat gat ttt 
Ser Pro Asp His Phe Asn Asn Asp Phe 
180 185 



aag gag acg att tct aag tat 432 
Lys Glu Thr He Ser Lys Tyr 
140 

ate tta acc ccc ctt gtg cac 480 
He Leu Thr Pro Leu Val His 
155 160 

tgt tgg caa att etc ttt ttt 528 
Cys Trp Gin He Leu Phe Phe 
170 175 

tat tga 561 
Tyr * 



<210> 167 
<211> 1188 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1188) 

<400> 167 

atg ttc cct gtt tea tea att ggt aat gat att agt agt gat tta gtc 48 

Met Phe Pro Val Ser Ser He Gly Asn Asp He Ser Ser Asp Leu Val 

1 5 10 15 

cgt cga aaa atg aat gac ctt cct gaa age cca aca gga aat aac etc 96 
Arg Arg Lys Met Asn Asp Leu Pro Glu Ser Pro Thr Gly Asn Asn Leu 
20 25 30 

gaa gca eta gca cca ggt ata gaa aaa eta aaa cag acc tct att gaa 144 
Glu Ala Leu Ala Pro Gly He Glu Lys Leu Lys Gin Thr Ser He Glu 
35 40 45 

atg gtc act tta ctt aac acg tta caa cct ggt gga aaa tgc att ate 192 
Met Val Thr Leu Leu Asn Thr Leu Gin Pro Gly Gly Lys Cys He He 
50 55 60 

act ggt gat ttt caa aaa gaa tta gcg tac tta caa aat gta att ctt 240 
Thr Gly Asp Phe Gin Lys Glu Leu Ala Tyr Leu Gin Asn Val He Leu 
65 70 75 80 

tat aat gtc teg tct ctt cgt ctg gat ttt tta ggt tat aac gcc caa 288 
Tyr Asn Val Ser Ser Leu Arg Leu Asp Phe Leu Gly Tyr Asn Ala Gin 
85 90 ' 95 

att att caa cga teg gac aat act tgt gaa ctt acc att aat gaa ccg 336 
He He Gin Arg Ser Asp Asn Thr Cys Glu Leu Thr He Asn Glu Pro 
100 105 110 

tta aaa aac cag gaa ata tec aca ggt aat ate aat att aat tgc cca 384 
Leu Lys Asn Gin Glu He Ser Thr Gly Asn He Asn He Asn Cys Pro 
115 120 125 

tta aaa gat att tac aat gaa ate agg agg tta aac gta att ttt agt 432 
Leu Lys Asp He Tyr Asn Glu He. Arg Arg Leu Asn Val He Phe Ser 
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130 135 140 

tgt ggg act gga gat ate gtt gat eta tec tct ctg gac tta cgt aat 480 
Cys Gly Thr Gly Asp lie Val Asp Leu Ser Ser Leu Asp Leu Arg Asn 
145 150 155 160 

gtc gat tta gat tat tat gat ttc aca gat aaa cat atg get aat act 528 
Val Asp Leu Asp Tyr Tyr Asp Phe Thr Asp Lys His Met Ala Asa Thr 
165 170 175 

att tta aat cct ttt aaa ttg aat tea aca aat ttt act aat gec aac 576 
lie Leu Asn Pro Phe Lys Leu Asn Ser Thr Asn Phe Thr Asn Ala Asn 
180 185 190 

atg ttt cag gtt aat ttt gtt agt tea aca caa aac gee aca ate tec 624 
Met Phe Gin Val Asn Phe Val Ser Ser Thr Gin Asn Ala Thr lie Ser 
195 200 205 

tgg gat tat tta eta aaa ata acg cct gtt tta ata age att age gat 672 
Trp Asp Tyr Leu Leu Lys lie Thr Pro Val Leu lie Ser lie Ser Asp 
210 215 220 

atg tat tct gaa gaa aaa ate aag ttt gtc gaa agt tgt tta aat gag 720 
Met Tyr Ser Glu Glu Lys lie Lys Phe Val Glu Ser Cys Leu Asn Glu 
225 , 230 235 240 

cct gga gac att acc gaa gaa caa tta aaa att atg aga ttt gca att 768 
Pro Gly Asp lie Thr Glu Glu Gin Leu Lys lie Met Arg Phe Ala lie 
245 250 255 

ata aaa tct ata cca agg gca act ctt aca gat aaa tta gaa aat gaa 816 
lie Lys Ser lie Pro Arg Ala Thr Leu Thr Asp Lys Leu Glu Asn Glu 
260 265 270 

tta aca aaa gaa ata tat aaa age tea teg aaa ate ate aat tgc ttg 864 
Leu Thr Lys Glu lie Tyr Lys Ser Ser Ser Lys lie lie Asn Cys Leu 
275 280 285 

aac aga att aaa tta aca gag atg aaa gaa ttc tea tea gaa aaa ata 912 
Asn Arg lie Lys Leu Thr Glu Met Lys Glu Phe Ser Ser Glu Lys lie 
290 295 300 

tat gat tac ate gat ata ate att gaa gat tat gaa aat acc aaa gaa 960 
Tyr Asp Tyr lie Asp lie lie He Glu Asp Tyr Glu Asn Thr Lys Glu 
305 " 310 315 320 

aat get tat ctg gtc gtc ccc caa att aat tat act atg gat tta aac 1008 
Asn Ala Tyr Leu Val Val Pro Gin He Asn Tyr Thr Met Asp Leu Asn 
325 330 335 

ata gaa gac tct age tea gaa gag tta ctt tea gat aat acc etc gag 1056 
He Glu Asp Ser Ser Ser Glu Glu Leu Leu Ser Asp Asn Thr Leu Glu 
340 345 350 

aaa gac gaa aat tct ccg gac aat ggc ttt gag gtc ggg gaa tat aac 1104 
Lys Asp Glu Asn Ser Pro Asp Asn Gly Phe Glu Val Gly Glu Tyr Asn 
355 * 360 365 

aca tat gaa gca tat aac tea gag aag caa tat ttt acc aga gag gac 1152 
Thr Tyr Glu Ala Tyr Asn Ser Glu Lys Gin Tyr Phe Thr Arg Glu Asp 
370 375 380 
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tat acg tat gat tac gac ctt tta aat gca ata tag 1188 
Tyr Thr Tyr Asp Tyr Asp Leu Leu Asn Ala lie * 
385 390 395 



<210> 168 
<211> 948 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(948) 

<400> 168 

gtg tgt cat cgc gca ttt cga ctt cat ctt tgc aag gac tgg gtt ttc 48 

Met Cys His Arg Ala Phe Arg Leu His Leu Cys Lys Asp Trp Val Phe 

15 10 15 

atg ttt tct ggg ctg tta ate att ctg gtt ccc ctg att gtg ggt tac 96 
Met Phe Ser Gly Leu Leu lie lie Leu Val Pro Leu lie Val Gly Tyr 
20 25 30 

etc att ccg ctt cgc caa caa get gcg tta aaa gtt att aat cag eta 144 
Leu lie Pro Leu Arg Gin Gin Ala Ala Leu Lys Val He Asn Gin Leu 
35 40 45 

tta age tgg atg gtt tac ctt att etc ttt ttt atg ggt ate agt ctg 192 
Leu Ser Trp Met Val Tyr Leu He Leu Phe Phe Met Gly He Ser Leu 
50 55 60 

gcg ttt etc gat aac etc gee agt aac ctg ttg gcg att ctg cat tat 240 
Ala Phe Leu Asp Asn Leu Ala Ser Asn Leu Leu Ala He Leu His Tyr 
65 70 75 80 

tct gee gtc agt att acc gtt att tta ctg tgt aat att gee gee ctg 288 
Ser Ala Val Ser He Thr Val He Leu Leu Cys Asn He Ala Ala Leu 
85 90 95 

atg tgg ctg gag cga ggc ctg ccg tgg cgc aac cac cat cag caa gaa 336 
Met Trp Leu Glu Arg Gly Leu Pro Trp Arg Asn His His Gin Gin Glu 
100 105 110 

aaa etc ccg teg cgt att gcg atg gcg ctg gag teg eta aaa ctg tgc 384 
Lys Leu Pro Ser Arg He Ala Met Ala Leu Glu Ser Leu Lys Leu Cys 
115 120 125 

ggc gta gta gtg att ggt ttt gee att ggt eta agt gga ctg get ttc 432 
Gly Val Val Val He Gly Phe Ala He Gly Leu Ser Gly Leu Ala Phe 
130 135 140 

tta caa cac gcg acc gaa gec agt gaa tac acg tta att ttg eta ctt 480 
Leu Gin His Ala Thr Glu Ala Ser Glu Tyr Thr Leu He Leu Leu Leu 
145 150 155 160 

ttc etc gtt ggt att cag ttg cgc aat aat ggc atg acc tta aag cag 528 
Phe Leu Val Gly He Gin Leu Arg Asn Asn Gly Met Thr Leu Lys Gin 
165 170 175 

att gtc ctt aat cgc egg gga atg att gtc gec gtg gtg gtg gtt gtc 576 
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lie Val lieu Asn Arg Arg Gly Met lie Val Ala Val Val Val Val Val 
180 ~ 185 190 

agt tea tta att ggt ggt tta att aac gec ttt att ctt gat etc ccc 624 
Ser Ser Leu He Gly Gly Leu He Asn Ala Phe He Leu Asp Leu Pro 
195 200 205 

ate aat ace gcg ctg gca atg gee tec ggt ttc ggc tgg tat tct ctt 672 
He Asn Thr Ala Leu Ala Met Ala Ser Gly Phe Gly Tip Tyr Ser Leu 
210 215 220 

tec ggt att tta ttg acc gaa tct ttt ggt ccg gta ate ggg age gcg 720 
Ser Gly He Leu Leu Thr Glu Ser Phe Gly Pro Val He Gly Ser Ala 
225 230 235 240 

gcg ttt ttt aat gat ctg gee cgt gaa ctg att get att atg ttg ate 768 
Ala Phe Phe Asn Asp Leu Ala Arg Glu Leu He Ala He Met Leu He 
245 250 255 

cct ggg ctg att cgc cgc age cgc tct act gca ctg ggc tta tgc ggt 816 
Pro Gly Leu He Arg Arg Ser Arg Ser Thr Ala Leu Gly Leu Cys Gly 
260 265 270 

gee aca tea atg gat ttc acc ctg ccc gtt ctt caa cgt act ggc ggg 864 
Ala Thr Ser Met Asp Phe Thr Leu Pro Val Leu Gin Arg Thr Gly Gly 
275 280 285 

ctg gat atg gtc ccg gcg gca att gtt cac ggt ttt att ctt age ctg 912 
Leu Asp Met Val Pro Ala Ala He Val His Gly Phe He Leu Ser Leu 
290 295 300 

tta gtg ccg ate etc ate gee ttt ttc tct gcg taa 948 
Leu Val Pro He Leu He Ala Phe Phe Ser Ala * 
305 310 315 



<210> 169 
<211> 291 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (291) 

<400> 169 

atg caa aac aca act cat gac aac gta att ctg gag etc acc gtt cgc 48 
Met Gin Asn Thr Thr His Asp Asn Val He Leu Glu Leu Thr Val Arg 
1 5 - 10 15 

aac cat ccg ggc gta atg acc cac gtt tgt ggc ctt ttt gee cgc cgc 96 
Asn His Pro Gly Val Met Thr His Val Cys Gly Leu Phe Ala Arg Arg 
20 25 30 

get ttt aac gtt gaa ggc att ctt tgt ctg ccg att cag gac age gac 144 
Ala Phe Asn Val Glu Gly He Leu Cys Leu Pro He Gin Asp Ser Asp 
35 40 45 

aaa age cat ate tgg eta ctg gtc aat gac gac cag cgt ctg gag cag 192 
Lys Ser His He Trp Leu Leu Val Asn Asp Asp Gin Arg Leu Glu Gin 
50 " 55 60 
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atg ata age caa ate gat aag ctg gaa gat gtc gtg aaa gtg cag cgt 
Met lie Ser Gin He Asp Lys Leu Glu Asp Val Val Lys Val Gin Arg 
65 70 75 80 



240 



aat cag tec gat ccg acg atg ttt aac aag ate gcg gtg ttt ttt cag 288 
Asn Gin Ser Asp Pro Thr Met Phe Asn Lys He Ala Val Phe Phe Gin 
85 90 95 

taa 291 



<210> 170 
<211> 1689 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1689) 

<400> 170 

atg gca agt teg ggc aca aca teg acg cgt aag cgc ttt acc ggc gca 48 
Met Ala Ser Ser Gly Thr Thr Ser Thr Arg Lys Arg Phe Thr Gly Ala 
15 10 15 

gaa ttt ate gtt cat ttc ctg gaa cag cag ggc att aag att gtg aca 96 
Glu Phe He Val His Phe Leu Glu Gin Gin Gly He Lys He Val Thr 
20 25 30 

ggc att ccg ggc ggt tct ate ctg cct gtt tac gat gee tta age caa 144 
Gly He Pro Gly Gly Ser He Leu Pro Val Tyr Asp Ala Leu Ser Gin 
35 40 45 

age acg caa ate cgc cat att ctg gee cgt cat gaa cag ggc gcg ggc 192 
Ser Thr Gin He Arg His He Leu Ala Arg His Glu Gin Gly Ala Gly 
50 55 60 

ttt ate get cag gga atg gcg cgc acc gac ggt aaa ccg gcg gtc tgt 240 
Phe He Ala Gin Gly Met Ala Arg Thr, Asp Gly Lys Pro Ala Val Cys 
65 70 75 80 

atg gee tgt age gga ccg ggt gcg act aac ctg gtg acc gee att gee 288 
Met Ala Cys Ser Gly Pro Gly Ala Thr Asn Leu Val Thr Ala He Ala 
85 90 95 

gat gcg egg ctg gac tec ate ccg ctg att tgc ate act ggt cag gtt 336 
Asp Ala Arg Leu Asp Ser He Pro Leu He Cys He Thr Gly Gin Val 
100 105 110 

ccc gee teg atg ate ggc acc gac gee ttc cag gaa gtg gac acc tac 384 
Pro Ala Ser Met He Gly Thr Asp Ala Phe Gin Glu Val Asp Thr Tyr 
115 120 125 

ggc ate tct ate ccc ate acc aaa cac aac tat ctg gtc aga cat ate 432 
Gly He Ser He Pro He Thr Lys His Asn Tyr Leu Val Arg His He 
130 135 140 

gaa gaa etc ccg cag gtc atg age gat gee ttc cgc att gcg caa tea 480 
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Glu Glu Leu Pro Gin Val Met Ser Asp Ala Phe Arg He Ala Gin Ser 
145 150 155 160 

ggc cgc cca ggc ccg gtg tgg ata gac att cct aag gat gtg caa acg 528 
Gly Arg Pro Gly Pro Val Trp He Asp He Pro Lys Asp Val Gin Thr 
165 170 175 

gca gtt ttt gag att gaa aca cag ccc get atg gca gaa aaa gec gec 576 
Ala Val Phe Glu He Glu Thr Gin Pro Ala Met Ala Glu Lys Ala Ala 
180 185 190 

gec ccc gec ttt age gaa gaa age att cgt gac gca gcg gcg atg att 624 
Ala Pro Ala Phe Ser Glu Glu Ser He Arg Asp Ala Ala Ala Met He 
195 200 205 

aac get gee aaa cgc ccg gtg ctt tat ctg ggc ggc ggt gtg ate aat 672 
Asn Ala Ala Lys Arg Pro Val Leu Tyr Leu Gly Gly Gly Val He Asn 
210 215 220 

gcg ccc gca egg gtg cgt gaa ctg gcg gag aaa gcg caa ctg cct ace 720 
Ala Pro Ala Arg Val Arg Glu Leu Ala Glu Lys Ala Gin Leu Pro Thr 
225 230 235 240 

ace atg act tta atg gcg ctg ggc atg ttg cca aaa gcg cat ccg ttg 768 
Thr Met Thr Leu Met Ala Leu Gly Met Leu Pro Lys Ala His Pro Leu 
245 250 255 

teg ctg ggt atg ctg ggg atg cac ggc gtg cgc age acc aac tat att 816 
Ser Leu Gly Met Leu Gly Met His Gly Val Arg Ser Thr Asn Tyr He 
260 265 270 

ttg cag gag gcg gat ttg ttg ata gtg etc ggt gcg cgt ttt gat gac 864 
Leu Gin Glu Ala Asp Leu Leu He Val Leu Gly Ala Arg Phe Asp Asp 
275 280 285 

egg gcg att ggc aaa acc gag cag ttc tgt ccg aat gec aaa ate att 912 
Arg Ala He Gly Lys Thr Glu Gin Phe Cys Pro Asn Ma Lys He He 
290 295 300 

cat gtc gat ate gac cgt gca gag ctg ggt aaa ate aag cag ccg cac 960 
His Val Asp He Asp Arg Ala Glu Leu Gly Lys He Lys Gin Pro. His 
305 310 315 320 

gtg gcg att cag gcg gat gtt gat gac gtg ctg gcg cag ttg ate ccg 1008 
Val Ala He Gin Ala Asp Val Asp Asp Val Leu Ala Gin Leu He Pro 
325 330 335 

ctg gtg gaa gcg caa ccg cgt gca gag tgg cac cag ttg gta gcg gat 1056 
Leu Val Glu Ala Gin Pro Arg Ala Glu Trp His Gin Leu Val Ala Asp 
340 345 350 . 

ttg cag cgt gag ttt ccg tgt cca ate ccg aaa gcg tgc gat ccg tta 1104 
Leu Gin Arg Glu Phe Pro Cys Pro He Pro Lys Ala Cys Asp Pro Leu 
355 360 365 

age cat tac ggc ctg ate aac gee gtt gec gec tgt gtc gat gac aat 1152 
Ser His Tyr Gly Leu He Asn Ala Val Ala Ala Cys Val Asp Asp Asn 
370 375 380 

gca att ate acc acc gac gtt ggt cag cat cag atg tgg acc gcg caa 1200 
Ala He He Thr Thr Asp Val Gly Gin His Gin Met Trp Thr Ala Gin 
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385 390 395 400 

get tat ccg etc aat cgc cca cgc cag tgg ctg acc tec ggt ggg ctg 1248 
Ala Tyr Pro Leu Asn Arg Pro Arg Gin Trp Leu Thr Ser Gly Gly Leu 
405 410 415 

ggc acg atg ggt ttt ggc ctg cct gcg gcg att ggc get gcg ctg gcg 1296 
Gly Thr Met Gly Phe Gly Leu Pro Ala Ala lie Gly Ala Ala Leu Ala 
420 425 430 

aac ccg gat cgc aaa gtg ttg tgt ttc tec ggc gac ggc age ctg atg 1344 
Asn Pro Asp Arg Lys Val Leu Cys Phe Ser Gly Asp Gly Ser Leu Met 
435 440 445 

atg aat att cag gag atg gcg acc gec agt gaa aat cag ctg gat gtc 1392 
Met Asn lie Gin Glu Met Ala Thr Ala Ser Glu Asn Gin Leu Asp Val 
450 455 460 

aaa ate att ctg atg aac aac gaa gcg ctg ggg ctg gtg cat cag caa 1440 
Lys lie lie Leu Met Asn Asn Glu Ala Leu Gly Leu Val His Gin Gin 
465 470 475 480 

cag agt ctg ttc tac gag caa ggc gtt ttt gee gec acc tat ccg ggc 1488 
Gin Ser Leu Phe Tyr Glu Gin Gly Val Phe Ala Ala Thr Tyr Pro Gly 
485 490 495 

aaa ate aac ttt atg cag att gee gec gga ttc ggc etc gaa acc tgt 1536 
Lys lie Asn Phe Met Gin lie Ala Ala Gly Phe Gly Leu Glu Thr Cys 
500 505 510 

gat ttg aat aac gaa gee gat ccg cag get tea ttg cag gaa ate ate 1584 
Asp Leu Asn Asn Glu Ala Asp Pro Gin Ala Ser Leu Gin Glu lie lie 
515 520 525 

aat cgc cct ggc ccg gcg ctg ate cat gtg cgc att gat gee gaa gaa 1632 
Asn Arg Pro Gly Pro Ala Leu lie His Val Arg He Asp Ala Glu Glu 
530 535 540 

aaa gtt tac ccg atg gtg ccg cca ggt gcg gcg aat act gaa atg gtg 1680 
Lys Val Tyr Pro Met Val Pro Pro Gly Ala Ala Asn Thr Glu Met Val 
545 550 555 560 

ggg gaa taa 1689 
Gly Glu * 



<210> 171 
<211> 99 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ... (99) 

<400> 171 

atg act act tec atg etc aac gca aaa eta eta cca act gcg cca tec 48 
Met Thr Thr Ser Met Leu Asn Ala Lys Leu Leu Pro Thr Ala Pro Ser 
15 10 15 
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gcc gca gtg gtc gtc gtg cgt gtg gtg gtg gtc gtc ggc aat gcg ccg 
Ala Ala Val Val Val Val Arg Val Val Val Val Val Gly Asn Ala Pro 
20 25 30 



96 



tag 



99 



<210> 172 
<211> 1017 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1017) 

<400> 172 . 
atg ttt gtc ate tgg age cat aga aca ggg ttc ate atg agt cat caa 48 
Met Phe Val lie Trp Ser His Arg Thr Gly Phe lie Met Ser His Gin 
15 10 15 

ctt ace ttc gcc gac agt gaa ttc age agt aag cgc cgt cag acc aga 96 
Leu Thr Phe Ala Asp Ser Glu Phe Ser Ser Lys Arg Arg Gin Thr Arg 
20 25 30 

aaa gag att ttc ttg tec cgc atg gag cag att ctg cca tgg caa aac 144 
Lys Glu lie Phe Leu Ser Arg Met Glu Gin lie Leu Pro Trp Gin Asn 
35 40 45 

atg gtg gaa gtc ate gag ccg ttt tac ccc aag get ggt aat ggc egg 192 
Met Val Glu Val He Glu Pro Phe Tyr Pro Lys Ala Gly Asn Gly Arg 
50 55 60 

cga cct tat ccg ctg gaa acc atg eta cgc att cac tgc atg cag cat 240 
Arg Pro Tyr Pro Leu Glu Thr Met Leu Arg He His Cys Met Gin His 
65 70 75 80 

tgg tac aac ctg age gat ggc gcg atg gaa gat get ctg tac gaa ate 288 
Trp Tyr Asn Leu Ser Asp Gly Ala Met Glu Asp Ala Leu Tyr Glu He 
85 90 95 

gcc tec atg cgt ctg ttt gcc egg tta tec ctg gat age gcc ttg ccg 336 
Ala Ser Met Arg Leu Phe Ala Arg Leu Ser Leu Asp Ser Ala Leu Pro 
100 105 110 

gac cgc acc acc ate atg aat ttc cgc cac ctg ctg gag cag cat caa 384 
Asp Arg Thr Thr He Met Asn Phe Arg His Leu Leu Glu Gin His Gin 
115 120 125 

ctg gcc cgc caa ttg ttc aag acc ate aat cgc tgg ctg gcc gaa gca 432 
Leu Ala Arg Gin Leu Phe Lys Thr He Asn Arg Trp Leu Ala Glu Ala 
130 135 140 

ggc gtc atg atg act caa ggc acc ttg gtc gat gcc acc ate att gag 480 
Gly Val Met Met Thr Gin Gly Thr Leu Val Asp Ala Thr He He Glu 
145 150 155 160 

gca ccc age teg acc aag aac aaa gag cag caa cgc gat ccg gag atg 528 
Ala Pro Ser Ser Thr Lys Asn Lys Glu Gin Gin Arg Asp Pro Glu Met 



-125- 



WO 01/48209 



PCT/US00/34419 



165 170 175 

cat cag acc aag aaa ggc aat cag tgg cac ttt ggc atg aag gcc cac 576 
His Gin Thr Lys Lys Gly Asn Gin Trp His Phe Gly Met Lys Ala His 
180 185 190 

att ggt gtc gat gcc aag agt ggc ctg acc cac age ctg gtc acc acc 624 
He Gly Val Asp Ala Lys Ser Gly Leu Thr His Ser Leu Val Thr Thr 
195 200 205 

gcg gcc aac gag cat gac etc aat cag ctg ggt aat ctg ctg cat gga 672 
Ala Ala Asn Glu His Asp Leu Asn Gin Leu Gly Asn Leu Leu His Gly 
210 215 220 

gag gag caa ttt gtc tea gcc gat gcc ggc tac caa ggg gcg cca cag 720 
Glu Glu Gin Phe Val Ser Ala Asp Ala Gly Tyr Gin Gly Ala Pro Gin 
225 230 235 240 

cgc gag gag ctg gcc gag gtg gat gtg gac tgg ctg ate gcc gag cgc 768 
Arg Glu Glu Leu Ala Glu Val Asp Val Asp Trp Leu He Ala Glu Arg 
245 250 255 

ccc ggc aag gta aga acc ttg aaa cag cat cca cgc aag aac aaa acg 816 
Pro Gly Lys Val Arg Thr Leu Lys Gin His Pro Arg Lys Asn Lys Thr 
260 265 270 

gcc ate aac ate gaa tac atg aaa gcc age ate egg gcc agg gtg gag 864 
Ala He Asn He Glu Tyr Met Lys Ala Ser He Arg Ala Arg Val Glu 
275 280 285 

cac cca ttt cgc ate ate aag cga cag ttc ggc ttc gtg aaa gcc aga 912 
His Pro Phe Arg He He Lys Arg Gin Phe Gly Phe Val Lys Ala Arg 
290 295 300 

tac aag ggg ttg ctg aaa aac gat aac caa ctg gcg atg tta ttc acg 960 
Tyr Lys Gly Leu Leu Lys Asn Asp Asn Gin Leu Ala Met Leu Phe Thr 
305 310 315 320 

ctg gcc aac ctg ttt egg gcg gac caa atg ata cgt cag tgg gag aga 1008 
Leu Ala Asn Leu Phe Arg Ala Asp Gin Met He Arg Gin Trp Glu Arg 
325 330 335 

tct cac taa 1017 
Ser His * 



<210> 173 
<211> 474 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (474) 

<400> 173 

atg gta tat ata ata ate gtt tec cac gga cat gaa gac tac ate aaa 48 
Met Val Tyr He He He Val Ser His Gly His Glu Asp Tyr He Lys 
15 10 15 
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aaa tta etc gaa aat ctt aat get gac gat gag cac tac aag att ate 96 
Lys Leu Leu Glu Asn Leu Asn Ala Asp Asp Glu His Tyr Lys lie lie 
20 25 30 

gta cgc gac aac aaa gac tct eta tta ttg aaa caa ata tgc cag cat 144 
Val Arg Asp Asn Lys Asp Ser Leu Leu Leu Lys Gin lie Cys Gin His 
35 40 45 

tat gca ggc ctg gac tat att agt gga ggt gta tac ggc ttt ggt cat 192 
Tyr Ala Gly Leu Asp Tyr lie Ser Gly Gly Val Tyr Gly Phe Gly His 
50 55 60 

aat aat aat att gcg gtg gcg tat gta aag gaa aaa tat aga ccc gca 240 
Asn Asn Asn lie Ala Val Ala Tyr Val Lys Glu Lys Tyr Arg Pro Ala 
65 J 70 75 80 

gat gat gat tac att ttg ttt ttg aat ccc gat ate ate atg aag cat 288 
Asp Asp Asp Tyr lie Leu Phe Leu Asn Pro Asp lie He Met Lys His 
85 90 95 

gat gat ttg ctg aca tat att aaa tat gtc gaa agt aag cgt tat get 336 
Asp Asp Leu Leu Thr Tyr He Lys Tyr Val Glu Ser Lys Arg Tyr Ala 
100 105 110 

ttt agt aca tta tgc ctg ttc cga gat gaa gcg aaa tct tta cat gat 384 
Phe Ser Thr Leu Cys Leu Phe Arg Asp Glu Ala Lys Ser Leu His Asp 
115 120 125 

tat tec gta aga aaa ttt cct gtg ctt tct gat ttt att gtg tea ttt 432 
Tyr Ser Val Arg Lys Phe Pro Val Leu Ser Asp Phe He Val Ser Phe 
130 135 140 

atg tta ggg att aag gaa ggt gcg aac aag tec ctg ata tga 474 
Met Leu Gly He Lys Glu Gly Ala Asn Lys Ser Leu He * 
145 ~ 150 155 



<210> 174 
<211> 1119 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1119) 
<400> 174 

atg gga aaa age ata gtc gtt gtt tct gcg gtc aat ttt acc act ggc 48 
Met Gly Lys Ser He Val Val Val Ser Ala Val Asn Phe Thr Thr Gly 
15 10 15 

ggt cca ttt acc att ttg aaa aaa ttt ttg gca gca act aat aat aaa 96 
Gly Pro Phe Thr He Leu Lys Lys Phe Leu Ala Ala Thr Asn Asn Lys 
20 25 30 

gaa aat gtc agt ttt ate gca tta gtc cat tct get aaa gag tta aaa 144 
Glu Asn Val Ser Phe He Ala Leu Val His Ser Ala Lys Glu Leu Lys 
35 40 45 

gaa agt tat cca tgg gtt aaa ttc att gag ttt cct gag gtt aaa ggg 192 
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Glu Ser Tyr Pro Trp Val Lys Phe He Glu Phe Pro Glu Val Lys Gly 
50 55 60 

teg tgg eta aaa cgt ttg cac ttt gaa tat gta gtt tgt aaa aaa ctt 240 
Ser Trp Leu Lys Arg Leu His Phe Glu Tyr Val Val Cys Lys Lys Leu 
65 70 75 80 

tea aaa gag ctg aat get acg cat tgg att tgt ctg cat gat att acg 288 
Ser Lys Glu Leu Asn Ala Thr His Trp lie Cys Leu His Asp He Thr 
85 90 95 

gec aat gtc gtc act aaa aaa aga tat gtg tat tgt cat aac cct gca 336 
Ala Asn Val Val Thr Lys Lys Arg Tyr Val Tyr Cys His Asn Pro Ala 
100 105 110 

cct ttt tat aaa gga att tta ttc cgt gaa att ctt atg gag cct age 384 
Pro Phe Tyr Lys Gly He Leu Phe Arg Glu He Leu Met Glu Pro Ser 
115 120 125 

ttt ttc tta ttt aaa atg eta tac ggg ctg ata tat aaa ata aac att 432 
Phe Phe Leu Phe Lys Met Leu Tyr Gly Leu He Tyr Lys He Asn He 
130 135 140 

aaa aaa aat act gca gtg ttt gtt caa caa ttc tgg atg aaa gaa aaa 480 
Lys Lys Asn Thr Ala Val Phe Val Gin Gin Phe Trp Met Lys Glu Lys 
145 150 155 160 

ttt ate aag aaa tat tct ata aat aac ate att gtc agt egg cca gaa 528 
Phe He Lys Lys Tyr Ser He Asn Asn He He Val Ser Arg Pro Glu 
165 170 175 

att aaa tta tct gat aaa age caa ctt act gat gat gat tct caa ttt 576 
He Lys Leu Ser Asp Lys Ser Gin Leu Thr Asp Asp Asp Ser Gin Phe 
180 185 190 

aag aat aac cct tct gag ttg aca ata ttt tac cct get gtt cca cga 624 
Lys Asn Asn Pro Ser Glu Leu Thr He Phe Tyr Pro Ala Val Pro Arg 
195 200 205 

gta ttt aaa aat tac gag ctt att att agt gca gca agg aaa ttg aaa 672 
Val Phe Lys Asn Tyr Glu Leu He He Ser Ala Ala Arg Lys Leu Lys 
210 215 220 

gaa caa tec aat att aaa ttt ctg ctt act ate agt ggt aca gaa aat 720 
Glu Gin Ser Asn He Lys Phe Leu Leu Thr He Ser Gly Thr Glu Asn 
225 230 235 240 

gcg tat gca aaa tat att ate agt ctt gca gaa gga ctg gat aat gtt 768 
Ala Tyr Ala Lys Tyr He He Ser Leu Ala Glu Gly Leu Asp Asn Val 
245 250 255 

cat ttc etc ggg tac ttg gat aaa gaa aaa ate gat cat tgt tat aat 816 
His Phe Leu Gly Tyr Leu Asp Lys Glu Lys He Asp His Cys Tyr Asn 
260 265 270 

att tea gat ata gtt tgt ttt ccc tct agg tta gaa aca tgg gga ttg 864 
He Ser Asp He Val Cys Phe Pro Ser Arg Leu Glu Thr Trp Gly Leu 
275 280 285 

ccg ttg tct gaa get aaa gag cga ggt aag tgg gta tta gca tea gat 912 
Pro Leu Ser Glu Ala Lys Glu Arg Gly Lys Trp Val Leu Ala Ser Asp 
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290 295 300 

ttc cca ttt act aga gaa act ctt ggt agt tat gaa aag aaa get ttt 960 
Phe Pro Phe Thr Arg Glu Thr Leu Gly Ser Tyr Glu Lys Lys Ala Phe 
305 310 315 320 

ttt gat tct aat aac gat gac atg tta gtt aaa ctt att att gac ttc 1008 
Phe Asp Ser Asn Asn Asp Asp Met Leu Val Lys Leu lie lie Asp Phe 
325 330 335 

aaa aaa ggt aac etc aaa aaa gat ate tct gat gca aat ttc att tat 1056 
Lys Lys Gly Asn Leu Lys Lys Asp He Ser Asp Ala Asn Phe He Tyr 
340 345 350 

cgt aat gaa aat gta tta gtt ggg ttt gat gaa eta gtt aat ttt att 1104 
Arg Asn Glu Asn Val Leu Val Gly Phe Asp Glu Leu Val Asn Phe He 
355 360 365 

act gaa gaa cat tga 1119 
Thr Glu Glu His * 
370 



<210> 175 
<211> 591 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (591) 

<400> 175 

atg ate tta aaa etc get aaa cga tat ggt etc tgt ggt ttt att egg 4B 
Met He Leu Lys Leu Ala Lys Arg Tyr Gly Leu Cys Gly Phe He Arg 
15 10 15 

ctt gtt aga gat gtc tta ttg act cgt gta ttt tac egg aac tgt aga 96 
Leu Val Arg Asp Val Leu Leu Thr Arg Val Phe Tyr Arg Asn Cys Arg 
20 25 30 

att att cga ttt ccc tgc tat att cgc aat gat ggt age att aat ttt 144 
He He Arg Phe Pro Cys Tyr He Arg Asn Asp Gly Ser He Asn Phe 
35 40 45 

ggt gaa aat ttc aca agt gga gtc ggt etc agg ctg gat gca ttt gga 192 
Gly Glu Asn Phe Thr Ser Gly Val Gly Leu Arg Leu Asp Ala Phe Gly 
50 55 60 

cgt ggc gtg att ttt ttt tec gat aat gtg caa gtt aac gac tat gtt 240 
Arg Gly Val He Phe Phe Ser Asp Asn Val Gin Val Asn Asp Tyr Val 
65 70 75 80 

cat ate gec tea att gag age gtt acg ata ggt egg gat acg ctt att 288 
His He Ala Ser He Glu Ser Val Thr He Gly Arg Asp Thr Leu He 
85 90 95 

gca agt aaa gta ttt att ace gat cat aat cac ggt tec ttt aag cac 336 
Ala Ser Lys Val Phe He Thr Asp His Asn His Gly Ser Phe Lys His 
100 . 105 110 
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tct gat cca atg agt teg cca aat ata cct cca gac atg cgc acg ttg 384 
Ser Asp Pro Met Ser Ser Pro Asn lie Pro Pro Asp Met Arg Thr Leu 
115 120 125 

gaa tct tea get gtt gta att ggc cag agg gtt tgg ttg ggt gag aat 432 
Glu Ser Ser Ala Val Val lie Gly Gin Arg Val Trp Leu Gly Glu Asn 
130 135 140 

gtg acg gtt ttg cct gga aca att att ggt aat gga gtc gta gtc ggc 480 
Val Thr Val Leu Pro Gly Thr He lie Gly Asn Gly Val Val Val Gly 
145 150 155 160 

gec aat tct gtt gtt aga ggt tct att ccc gaa aat act gtc att gcg 528 
Ala Asn Ser Val Val Arg Gly Ser He Pro Glu Asn Thr Val He Ala 
165 170 175 

gta cca gca aaa ate ata aag aaa tac aat cat gag acc aaa tta 576 
Gly Val Pro Ala Lys He He Lys Lys Tyr Asn His Glu Thr Lys Leu 
180 185 190 

tgg gaa aaa gca tag 591 
Trp Glu Lys Ala * 
195 



<210> 176 
<211> 993 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (993) 

<400> 176 

atg tat ttt ttg aat gat tta aat ttc tct aga cgc gat get gga ttt 48 
Met Tyr Phe Leu Asn Asp Leu Asn Phe Ser Arg Arg Asp Ala Gly Phe 
1 5 10 15 

aaa gca aga aaa gat gca ctg gac att get tea gat tat gaa aac att 96 
Lys Ala Arg Lys Asp Ala Leu Asp He Ala Ser Asp Tyr Glu Asn He 
20 25 30 

tct gtt gtt aac att cct eta tgg ggt gga gta gtc cag aga att att 144 
Ser Val Val Asn He Pro Leu Trp Gly Gly Val Val Gin Arg He He 
35 . 40 45 

agt tct gtt aag ctt agt aca ttt etc tgc ggt ctt gaa aat aaa gat 192 
Ser Ser Val Lys Leu Ser Thr Phe Leu Cys Gly Leu Glu Asn Lys Asp 
50 55 60 

gtt tta att ttc aat ttc ccg atg gec aaa cca ttt tgg cat ata ttg 240 
Val Leu He Phe Asn Phe Pro Met Ala Lys Pro Phe Trp His He Leu 
65 70 75 80 

tea ttc ttt cac cgc ctt eta aaa ttt aga ata gta cct ctg att cat 288 
Ser Phe Phe His Arg Leu Leu Lys Phe Arg He Val Pro Leu He His 
85 90 95 

gat att gat gaa tta aga gga gga ggg ggt agt gat tct gtg egg ctt 336 
Asp He Asp Glu Leu Arg Gly Gly Gly Gly Ser Asp Ser Val Arg Leu 
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100 105 110 

get acc tgt gat atg gtc ata agt cac aat cca caa atg aca aag tac 384 
Ala Thr Cys Asp Met Val lie Ser His Asn Pro Gin Met Thr Lys Tyr 
115 120 125 

ctt agt aaa tat atg tct cag gat aaa ate aaa gac ata aaa ata ttt 432 
Leu Ser Lys Tyr Met Ser Gin Asp Lys lie Lys Asp lie Lys He Phe 
130 135 140 

gat tac etc gtc tea tct gat gtg gag cat cga gat gtt acg gat aag 480 
Asp Tyr Leu Val Ser Ser Asp Val Glu His Arg Asp Val Thr Asp Lys 
145 150 155 160 

caa cga ggg gtc ata tat get ggc aac ctt tct agg cat aaa tgt tct 528 
Gin Arg Gly Val He Tyr Ala Gly Asn Leu Ser Arg His Lys Cys Ser 
165 170 175 

ttc ata tat act gaa gga tgc gat ttt act etc ttt ggt gtc aac tat 576 
Phe He Tyr Thr Glu Gly Cys Asp Phe Thr Leu Phe Gly Val Asn Tyr 
180 185 190 

gaa aat aaa gat aat cct aaa tat ctt gga agt ttt gat get caa tct 624 
Glu Asn Lys Asp Asn Pro Lys Tyr Leu Gly Ser Phe Asp Ala Gin Ser 
195 200 205 

ccg gaa aag att aac etc cca ggc atg caa ttt gga etc att tgg gat 672 
Pro Glu Lys He Asn Leu Pro Gly Met Gin Phe Gly Leu He Trp Asp 
210 215 | 220 

gga gat tct gtc gaa acc tgt agt ggt gec ttt ggc gac tat tta aag 720 
Gly Asp Ser Val Glu Thr Cys Ser Gly Ala Phe Gly Asp Tyr Leu Lys 
225 230 235 240 

ttt aat aac cct cat aag aca tct ctt tat ctt tea atg gaa ctt cca 768 
Phe Asn Asn Pro His Lys Thr Ser Leu Tyr Leu Ser Met Glu Leu Pro 
245 250 255 

gta ttt ata tgg gat aaa gec gee ctt gcg gat ttc att gta gat aat 816 
Val Phe He Trp Asp Lys Ala Ala Leu Ala Asp Phe He Val Asp Asn 
260 265 270 

aga ata gga tat gca gtg gga tea ate aaa gaa atg caa gag att gtt 864 
Arg He Gly Tyr Ala Val Gly Ser He Lys Glu Met Gin Glu He Val 
275 280 285 

gac tec atg aca ata gaa act tat aag caa att agt gag aat aca aaa 912 
Asp Ser Met Thr He Glu Thr Tyr Lys Gin He Ser Glu Asn Thr Lys 
290 295 300 



att att tct cag aaa att cga aca gga agt tac ttc agg gat gtt ctt 
He He Ser Gin Lys He Arg Thr Gly Ser Tyr Phe Arg Asp Val Leu 
305 310 315 320 



960 



gaa gag gtg ate gat gat ctt aaa act cgc taa 993 
Glu Glu Val He Asp Asp Leu Lys Thr Arg * 
325 330 



<210> 177 
<211> 1167 
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<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1167) 

<400> 177 

atg ate tat ctt gta att agt gtc ttt etc att aca gca ttt ate tgt 48 
Met lie Tyr Leu Val lie Ser Val Phe Leu lie Thr Ala Phe lie Cys 
15 10 15 

tta tat ctt aag aag gat at a ttt tat cca gec gta tgc gtt aat ate 96 
Leu Tyr Leu Lys Lys Asp lie Phe Tyr Pro Ala Val Cys Val Asn He 
20 25 30 

ate ttc gca ctg gtc tta ttg gga tat gaa ata acg tea gat ata tat 144 
He Phe Ala Leu Val Leu Leu Gly Tyr Glu He Thr Ser Asp He Tyr 
35 40 45 

get ttt cag tta aat gac get acg ttg att ttt eta ctt tgc aat gtt 192 
Ala Phe Gin Leu Asn Asp Ala Thr Leu He Phe Leu Leu Cys Asn Val 
50 - 55 60 

ttg aca ttt ace ctg tea tgt tta ttg acg gaa agt gta tta gat eta 240 
Leu Thr Phe Thr Leu Ser Cys Leu Leu Thr Glu Ser Val Leu Asp Leu 
65 70 75 , 80 

aat ate aga aaa gtc aat aat get att tat age ata cca teg aag aaa 288 
Asn He Arg Lys Val Asn Asn Ala He Tyr Ser He Pro Ser Lys Lys 
85 90 95 

gtg cat aat gta ggc ttg tta gtt att tct ttt teg atg ata tat ata 336 
Val His Asn Val Gly Leu Leu Val He Ser Phe Ser Met He Tyr He 
100 105 110 

tgc atg agg tta agt aac tac cag ttc ggg act age tta ctt age tat 384 
Cys Met Arg Leu Ser Asn Tyr Gin Phe Gly Thr Ser Leu Leu Ser Tyr 
115 120 125 

atg aat ttg ata aga gat get gat gtt gaa gac aca tea aga aat ttc 432 
Met Asn Leu He Arg Asp Ala Asp Val Glu Asp Thr Ser Arg Asn Phe 
130 135 140 

tea gca tac atg cag cca ate att eta act act ttt get tta ttt att 480 
Ser Ala Tyr Met Gin Pro He lie Leu Thr Thr Phe Ala Leu Phe He 
145 150 155 160 

tgg tct aaa aaa ttt act aat aca aag gta agt aaa aca ttt act tta 528 
Trp Ser Lys Lys Phe Thr Asn Thr Lys Val Ser Lys Thr Phe Thr Leu 
165 170 175 

ctt gtt ttt att gta ttc ate ttt gca att ata ctg aat act ggt aag 576 
Leu Val Phe He Val Phe He Phe Ala He He Leu Asn Thr Gly Lys 
180 185 190 

caa att gtc ttt atg gtt ate ate tct tat gca ttc ate gta ggt gtt 624 
Gin He Val Phe Met Val He He Ser Tyr Ala Phe He Val Gly Val 
195 200 205 

aat aga gta aaa cat tat gtt tat ctt att aca get gta ggt gtt eta 672 
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Asn Arg Val Lys His Tyr Val Tyr Leu lie Thr Ala Val Gly Val Leu 
210 215 220 

ttc tec ttg tat atg etc ttt tta cgt gga ctg cct ggg ggg atg gca 720 
Phe Ser Leu Tyr Met Leu Phe Leu Arg Gly Leu Pro Gly Gly Met Ala 
225 230 235 240 

tat tat eta tec atg tat ttg gtc age cct ata ate gcg ttt cag gag 768 
Tyr Tyr Leu Ser Met Tyr Leu Val Ser Pro lie He Ala Phe Glri Glu 
245 250 255 

ttt tat ttt cag caa gta tct aac tct gee agt tct cat gtc ttt tgg 816 
Phe Tyr Phe Gin Gin Val Ser Asn Ser Ala Ser Ser His Val Phe Trp 
260 265 270 

ttt ttt gaa agg ctg atg ggg eta tta aca ggt gga gtc tct atg teg 864 
Phe Phe Glu Arg Leu Met Gly Leu Leu Thr Gly Gly Val Ser Met Ser 
275 280 285 

ttg cat aaa gaa ttt gtg tgg gtg ggt ttg cca aca aat gtt tat act 912 
Leu His Lys Glu Phe Val Trp Val Gly Leu Pro Thr Asn Val Tyr Thr 
290 295 300 

get ttt teg gat tat gtt tat att tec gcg gag eta age tat ttg atg 960 
Ala Phe Ser Asp Tyr Val Tyr lie Ser Ala Glu Leu Ser Tyr Leu Met 
305 310 315 320 

atg gtt att cat ggc tgt att tea ggt gtt tta tgg aga ttg tct cga 1008 
Met Val He His Gly Cys lie Ser Gly Val Leu Trp Arg Leu Ser Arg 
325 330 335 

aat tac ata tct gtg aaa ata ttt tat tea tat ttt att tat ace ttt 1056 
Asn Tyr He Ser Val Lys He Phe Tyr Ser Tyr Phe He Tyr Thr Phe 
340 345 350 

tct ttc att ttt tat cat gaa age ttc atg act aat att age agt tgg 1104 
Ser Phe He Phe Tyr His Glu Ser Phe Met Thr Asn He Ser Ser Trp 
355 360 365 

ata caa ata act ctt tgt ate ata gta ttc tct caa ttt ctt aag gee 1152 
He Gin He Thr Leu Cys He He Val Phe Ser Gin Phe Leu Lys Ala 
370 375 380 

cag aaa ata aag tga 1167 

Gin Lys He Lys * 

385 



<210> 178 
<211> 1104 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1104) 

<400> 178 

atg tac gat tat ate att gtt ggt tct ggt ttg ttt ggt gee gtt tgt 48 
Met Tyr Asp Tyr He He Val Gly Ser Gly Leu Phe Gly Ala Val Cys 
15 10 15 
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gcg aat gag tta aaa aag eta aac aaa aaa gtt tta gtg att gag aaa 96 
Ala Asn Glu Leu Lys Lys Leu Asn Lys Lys Val Leu Val lie Glu Lys 
20 25 30 

aga aat cat ate ggt gga aat gcg tac aca gag gac tgt gag ggt ate 144 
Arg Asn His lie Gly Gly Asn Ala Tyr Thr Glu Asp Cys Glu Gly lie 
35 40 45 

cag att cat aaa tat ggt gca cat att ttt cat acc aat gat aaa tat 192 
Gin lie His Lys Tyr Gly Ala His He Phe His Thr Asn Asp Lys Tyr 
50 55 60 

at a tgg gat tac gtt aat gat tta gta gaa ttt aat cgt ttt act aat 240 
He Trp Asp Tyr Val Asn Asp Leu Val Glu Phe Asn Arg Phe Thr Asn 
65 70 75 80 

tct cca ctg gcg att tat aaa gac aaa tta ttc aac ctt cct ttt aat 288 
Ser Pro Leu Ala He Tyr Lys Asp Lys Leu Phe Asn Leu Pro Phe Asn 
85 90 95 

atg aat act ttc cac caa atg tgg gga gtt aaa gat cct caa gaa get 336 
Met Asn Thr Phe His Gin Met Trp Gly Val Lys Asp Pro Gin Glu Ala 
100 105 110 

caa aat ate att aat get cag aaa aaa aag tat ggt gac aag gta cct 384 
Gin Asn He He Asn Ala Gin Lys Lys Lys Tyr Gly Asp Lys Val Pro 
115 120 125 

gaa aat ttg gag gag cag gcg att tea tta gtt ggg gag gac tta tac 432 
Glu Asn Leu Glu Glu Gin Ala He Ser Leu Val Gly Glu Asp Leu Tyr 
130 135 140 

caa gca ttg ata aag ggt tat acg gag aag cag tgg gga aga agt gca 480 
Gin Ala Leu He Lys Gly Tyr Thr Glu Lys Gin Trp Gly Arg Ser Ala 
145 150 155 160 

aaa gaa ttg cct gca ttt att att aag cga ate cca gtg aga ttt acg 528 
Lys Glu Leu Pro Ala Phe He He Lys Arg He Pro Val Arg Phe Thr 
165 170 175 

ttt gat aac aat tat ttt tec gat cgc tat caa ggt att ccg gtg gga 576 
Phe Asp Asn Asn Tyr Phe Ser Asp Arg Tyr Gin Gly He Pro Val Gly 
180 185 190 

ggc tac act aag ctt att gaa aaa atg ctt gaa ggt gtg gac gta aaa 624 
Gly Tyr Thr Lys Leu He Glu Lys Met Leu Glu Gly Val Asp Val Lys 
195 200 205 

tta ggc att gat ttt ttg aaa gac aaa gat tct eta gcg agt aaa gec 672 
Leu Gly He Asp Phe Leu Lys Asp Lys Asp Ser Leu Ala Ser Lys Ala 
210 215 220 

cat aga ate ate tac act gga ccc att gat cag tac ttc gac tat agg 720 
His Arg He He Tyr Thr Gly Pro He Asp Gin Tyr Phe Asp Tyr Arg 
225 230 235 240 

ttt gga gcg tta gaa tat cgc tct tta aaa ttt gag acg gaa cgc cat 768 
Phe Gly Ala Leu Glu Tyr Arg Ser Leu Lys Phe Glu Thr Glu Arg His 
245 250 255 



-134- 



WO 01/48209 



PCT/DS00/34419 



gaa ttt cca aac ttc caa ggg aat gca gta ata aat ttc act gat get 816 
Glu Phe Pro Asn Phe Gin Gly Asn Ala Val lie Abii Phe Thr Asp Ala 
260 265 270 

aat gta cca tat acc aga ata att gag cat aaa cat ttt gac tat gtt 864 
Asn Val Pro Tyr Thr Arg lie lie Glu His Lys His Phe Asp Tyr Val 
275 280 285 

gag aca aag cat acg gtt gtt aca aaa gaa tat cca tta gag tgg aaa 912 
Glu Thr Lys His Thr Val Val Thr Lys Glu Tyr Pro Leu Glu Trp Lys 
290 295 300 

gtt ggc gac gaa ccc tac tat cca gtt aat gat aat aaa aac atg gag 960 
Val Gly Asp Glu Pro Tyr Tyr Pro Val Asn Asp Asn Lys Asn Met Glu 
305 310 315 320 

ctt ttt aag aaa tat aga gag tta get age aga gaa gac aag gtt ata 1008 
Leu Phe Lys Lys Tyr Arg Glu Leu Ala Ser Arg Glu Asp Lys Val lie 
325 330 335 

ttt ggc ggg cgt ttg gec gag tat aaa tat tat gat atg cat caa gtg 1056 
Phe Gly Gly Arg Leu Ala Glu Tyr Lys Tyr Tyr Asp Met His Gin Val. 
340 345 350 

ata tct gee get ctt tat caa gtg aaa aat ata atg agt acg gat taa 1104 
lie Ser Ala Ala Leu Tyr Gin Val Lys Asn lie Met Ser Thr Asp * 
355 360 365 



<210> 179 
<211> 693 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (693) 

<400> 179 

ttg aca tea tea ata acc aat gaa att atg caa tta tat acg gat agg 48 
Met Thr Ser Ser lie Thr Asn Glu lie Met Gin Leu Tyr Thr Asp Arg 
1 5 10 15 

gag gtt ctt aac atg ggg ctt tgt agt cgt tat aaa agt ctt aca tgc 96 
Glu Val Leu Asn Met Gly Leu Cys Ser Arg Tyr Lys Ser Leu Thr Cys 
20 25 30 

aat agt tgc tea atg cat tgc caa ata atg cca gaa gag tea ccg cgt 144 
Asn Ser Cys Ser Met His Cys Gin lie Met Pro Glu Glu Ser Pro Arg 
35 40 45 

ttg cag tat tgc gca aac teg tgt ttt tgt atg tgg ccc gaa gaa agt 192 
Leu Gin Tyr Cys Ala Asn Ser Cys Phe Cys Met Trp Pro Glu Glu Ser 
50 55 60 

tea tat ttt aat cgt ggt gtg gta gaa ggt ate tta aca aaa aac cac 240 
Ser Tyr Phe Asn Arg Gly Val Val Glu Gly lie Leu Thr Lys Asn His 
65 70 75 80 

aat gcg aga tta age gga tat att ttt gtc gat ttt tea gta agt ttt 288 
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Asn Ala Arg Leu Ser Gly Tyr lie Phe Val Asp Phe Ser Val Ser Phe 
85 90 95 

tta cgt eta ttc ctt gaa aaa gac tgg att gac tat ctt gca agt act 336 
Leu Arg Leu Phe Leu Glu Lys Asp Trp lie Asp Tyr Leu Ala Ser Thr 
100 105 110 

gac atg ggg att gtt tta gtc age gac cgc aat atg caa teg tta get 384 
Asp Met Gly lie Val Leu Val Ser Asp Arg Asn Met Gin Ser Leu Ala 
115 120 125 

aat tac tgg cga aaa cac aac tea gca ata tct get gtc ata tat aat 432 
Asn Tyr Trp Arg Lys His Asn Ser Ala lie Ser Ala Val lie Tyr Asn 
130 135 140 

gat gat ggt ctt gat gtt gee aat gaa aag ate aga caa eta ttt att 480 
Asp Asp Gly Leu Asp Val Ala Asn Glu Lys lie Arg Gin Leu Phe He 
145 150 155 160 

ggt cgt tat tta tea ttt acc gga ggt aac aca tta act cag atg gaa 528 
Gly Arg Tyr Leu Ser Phe Thr Gly Gly Asn Thr Leu Thr Gin Met Glu 
165 170 175 

ttc acc ata atg ggg tat atg gtt tct ggt tat aat cca tat cag att 576 
Phe Thr He Met Gly Tyr Met Val Ser Gly Tyr Asn Pro Tyr Gin He 
180 185 190 

get gaa gtt ctg gat atg gat ate cgt age ate tac gcg tac aag caa 624 
Ala Glu Val Leu Asp Met Asp He Arg Ser He Tyr Ala Tyr Lys Gin 
195 " 200 205 

cga ate gaa aag aga atg ggt ggt aaa ata aac gaa tta ttt att cgt 672 
Arg He Glu Lys Arg Met Gly Gly Lys He Asn Glu Leu Phe He Arg 
210 215 220 

tea cat teg gtc caa cat tga 693 
Ser His Ser Val Gin His * 
225 230 



<210> 180 
<211> 1176 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1176) 

<400> 180 

atg caa aaa etc att aac tea gtg caa aac tat gee tgg ggc age aaa 48 
Met Gin Lys Leu He Asn Ser Val Gin Asn Tyr Ala Trp Gly Ser Lys 
15 10 " 15 

acg gcg ttg act gaa ctt tat ggt atg gaa aat ccg tec age cag ccg 96 
Thr Ala Leu Thr Glu Leu Tyr Gly Met Glu Asn Pro Ser Ser Gin Pro 
20 25 30 

atg gee gag ctg tgg atg ggc gca cat ccg aaa age agt tea cga gtg 144 
Met Ala Glu Leu Trp Met Gly Ala His Pro Lys Ser Ser Ser Arg Val 
35 40 45 
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cag aat gcc gcc gga gat ate gtt tea ctg cgt gat gtg att gag agt 192 
Gin Asn Ala Ala Gly Asp lie Val Ser Leu Arg Asp Val lie Glu Ser 
50 55 60 

gat aaa teg act ctg etc gga gag gcc gtt gcc aaa cgc ttt ggc gaa 240 
Asp Lys Ser Thr Leu Leu Gly Glu Ala Val Ala Lys Arg Phe Gly Glu 
65 70 75 80 

ctg cct ttc ctg ttc aaa gta tta tgc gca gca cag cca etc tec att 288 
Leu Pro Phe Leu Phe Lys Val Leu Cys Ala Ala Gin Pro Leu Ser He 
85 90 95 

cag gtt cat cca aac aaa cac aat tct gaa ate ggt ttt gcc aaa gaa 336 
Gin Val His Pro Asn Lys His Asn Ser Glu He Gly Phe Ala Lys Glu 
100 105 110 

aat gcc gca ggt ate ccg atg gat gcc gcc gag cgt aac tat aaa gat 384 
Asn Ala Ala Gly He Pro Met Asp Ala Ala Glu Arg Asn Tyr Lys Asp 
115 120 125 

cct aac cac aag ccg gag ctg gtt ttt gcg ctg acg cct ttc ctt gcg 432 
Pro Asn His Lys Pro Glu Leu Val Phe Ala Leu Thr Pro Phe Leu Ala 
130 135 140 

atg aac gcg ttt cgt gaa ttt tec gag att gtc tec eta etc cag ccg 480 
Met Asn Ala Phe Arg Glu Phe Ser Glu He Val Ser Leu Leu Gin Pro 
145 150 155 160 

gtc gca ggt gca cat ccg gcg att get cac ttt tta caa cag cct gat 528 
Val Ala Gly Ala His Pro Ala He Ala His Phe Leu Gin Gin Pro Asp 
165 170 175 

gcc gaa cgt tta age gaa ctg ttc gcc age ctg ttg aat atg cag ggt 576 
Ala Glu Arg Leu Ser Glu Leu Phe Ala Ser Leu Leu Asn Met Gin Gly 
180 185 190 

gaa gaa aaa tec cgc gcg ctg gcg att tta aaa teg gcc etc gat age 624 
Glu Glu Lys Ser Arg Ala Leu Ala He Leu Lys Ser Ala Leu Asp Ser 
195 200 205 

cag cag ggt gaa ccg tgg caa acg att cgt tta att tct gaa ttt tac 672 
Gin Gin Gly Glu Pro Trp Gin Thr He Arg Leu He Ser Glu Phe Tyr 
210 215 220 

ccg gaa gac age ggt ctg ttc tec ccg eta ttg ctg aat gtg gtg aaa 720 
Pro Glu Asp Ser Gly Leu Phe Ser Pro Leu Leu Leu Asn Val Val Lys 
225 230 235 240 

ttg aac cct ggc gaa gcg atg ttc ctg ttc get gaa aca ccg cac get 768 
Leu Asn Pro Gly Glu Ala Met Phe Leu Phe Ala Glu Thr Pro His Ala 
245 250 255 

tac ctg caa ggc gtg gcg ctg gaa gtg atg gca aac tec gat aac gtg 816 
Tyr Leu Gin Gly Val Ala Leu Glu Val Met Ala Asn Ser Asp Asn Val 
260 265 270 

ctg cgt gcg ggt ctg acg cct aaa tac att gat att ccg gaa ctg gtt 864 
Leu Arg Ala Gly Leu Thr Pro Lys Tyr He Asp He Pro Glu Leu Val 
275 280 285 
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gcc aat gtg aaa ttc gaa gcc aaa ccg get aac cag ttg ttg acc cag 912 
Ala Asn Val Lys Phe Glu Ala Lys Pro Ala Asn Gin Leu Leu Thr Gin 
290 295 300 

ccg gtg aaa caa ggt gca gaa ctg gac ttc ccg att cca gtg gat gat 960 
Pro Val Lys Gin Gly Ala Glu Leu Asp Phe Pro He Pro Val Asp Asp 
305 310 315 320 

ttt gcc ttc teg ctg cat gac ctt agt gat aaa gaa acc acc att age 1008 
Phe Ala Phe Ser Leu His Asp Leu Ser Asp Lys Glu Thr Thr He Ser 
325 330 335 

cag cag agt gcc gcc att ttg ttc tgc gtc gaa ggc gat gca acg ttg 1056 
Gin Gin Ser Ala Ala He Leu Phe Cys Val Glu Gly Asp Ala Thr Leu 
340 345 350 

tgg aaa ggt tct cag cag tta cag ctt aaa ccg ggt gaa tea gcg ttt 1104 
Trp Lys Gly Ser Gin Gin Leu Gin Leu Lys Pro Gly Glu Ser Ala Phe 
355 360 365 

att gcc gcc aac gaa tea ccg gtg act gtc aaa ggc cac ggc cgt tta 1152 
He Ala Ala Asn Glu Ser Pro Val Thr Val Lys Gly His Gly Arg Leu 
370 375 380 



gcg cgt gtt tac aac aag ctg taa 1176 
Ala Arg Val Tyr Asn Lys Leu * 
385 390 



<210> 181 

<211> 795 

<212> DNA 

<213> Escherichia coli . 

<220> 

<221> CDS 

<222> (1) . . . (795) 

<400> 181 

atg aaa cag tat tta gaa ctg atg caa aaa gtg etc gac gaa ggc aca 48 
Met Lys Gin Tyr Leu Glu Leu Met Gin Lys Val Leu Asp Glu Gly Thr 
15 10 15 

cag aaa aac gac cgt acc gga acc gga acg ctt tec att ttt ggt cat 96 
Gin Lys Asn Asp Arg Thr Gly Thr Gly Thr Leu Ser He Phe Gly His 
20 25 30 

cag atg cgt ttt aac ctg caa gat gga ttc ccg ctg gtg aca act aaa 144 
Gin Met Arg Phe Asn Leu Gin Asp Gly Phe Pro Leu Val Thr Thr Lys 
35 40 45 

cgt tgc cac ctg cgt tec ate ate cat gaa ctg ctg tgg ttt ctg cag 192 
Arg Cys His Leu Arg Ser He He His Glu Leu Leu Trp Phe Leu Gin 
50 55 60 

ggc gac act aac att get tat eta cac gaa aac aat gtc acc ate tgg 240 
Gly Asp Thr Asn He Ala Tyr Leu His Glu Asn Asn Val Thr He Trp 
65 70 75 80 

gac gaa tgg gcc gat gaa aac ggc gac etc ggg cca gtg tat ggt aaa 288 
Asp Glu Trp Ala Asp Glu Asn Gly Asp Leu Gly Pro Val Tyr Gly Lys 
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85 90 95 

cag tgg cgc gcc tgg cca acg cca gat ggt cgt cat att gac cag ate 336 
Gin Trp Arg Ala Trp Pro .Thr Pro Asp Gly Arg His He J^sp Gin He 
100 105 110 

act acg gta ctg aac cag ctg aaa aac gac ccg gat teg cgc cgc att 384 
Thr Thr Val Leu Asn Gin Leu Lys Asn Asp Pro Asp Ser Arg Arg He 
115 120 125 

att gtt tea gcg tgg aac gta ggc gaa ctg gat aaa atg gcg ctg gca 432 
He Val Ser Ala Trp Asn Val Gly Glu Leu Asp Lys Met Ala Leu Ala 
130 135 140 

ccg tgc cat gca ttc ttc cag ttc tat gtg gca gac ggc aaa etc tct 480 
Pro Cys His Ala Phe Phe Gin Phe Tyr Val Ala Asp Gly Lys Leu Ser 
145 150 155 160 

tgc cag ctt tat cag cgc tec tgt gac gtc ttc etc ggc ctg ccg ttc 528 
Cys Gin Leu Tyr Gin Arg Ser Cys Asp Val Phe Leu Gly Leu Pro Phe 
165 170 175 

aac att gcc age tac gcg tta ttg gtg cat atg atg gcg cag cag tgc 576 
Asn He Ala Ser Tyr Ala Leu Leu Val His Met Met Ala Gin Gin Cys 
180 185 190 

gat ctg gaa gtg ggt gat ttt gtc tgg ace ggt ggc gac acg cat ctg 624 
Asp Leu Glu Val Gly Asp Phe Val Trp Thr Gly Gly Asp Thr His Leu 
195 200 205 

tac age aac cat atg gat caa act cat ctg caa tta age cgc gaa ccg 672 
Tyr Ser Asn His Met Asp Gin Thr His Leu Gin Leu Ser Arg Glu Pro 
210 215 220 

cgt ccg ctg ccg aag ttg att ate aaa cgt aaa ccc gaa tec ate ttc 720 
Arg Pro Leu Pro Lys Leu He He Lys Arg Lys Pro Glu Ser He Phe 
225 230 235 240 

gac tac cgt ttc gaa gac ttt gag att gaa ggc tac gat ccg cat ccg 768 
Asp Tyr Arg Phe Glu Asp Phe Glu He Glu Gly Tyr Asp Pro His Pro 
245 250 255 

ggc att aaa gcg ccg gtg get ate taa 795 
Gly He Lys Ala Pro Val Ala He * 
260 



<210> 182 
<211> 876 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (876) 

<400> 182 

atg acc agt age tat ctg cat ttt ccg gag ttt gat ccg gtc att ttc 48 
Met Thr Ser Ser Tyr Leu His Phe Pro Glu Phe Asp Pro Val He Phe 
1 5 10 15 
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tea ata gga ccc gtg gcg ctt cac tgg tac ggc ctg atg tat ctg gtg 96 
Ser lie Gly Pro Val Ala Leu His Trp Tyr Gly Leu Met Tyr Leu Val 
20 25 30 

ggt ttc att ttt gca atg tgg ctg gca aca cga egg gcg aat cgt ccg 144 
Gly Phe He Phe Ala Met Trp Leu Ala Thr Arg Arg Ala Asn Arg Pro 
35 40 45 

ggc age ggc tgg ace aaa aat gaa gtt gaa aac tta etc tat gcg ggc 192 
Gly Ser Gly Trp Thr Lys Asn Glu Val Glu Asn Leu Leu Tyr Ala Gly 
50 55 60 

ttc etc ggc gtc ttc etc ggg gga cgt att ggt tat gtt ctg ttc tac 240 
Phe Leu Gly Val Phe Leu Gly Gly Arg He Gly Tyr Val Leu Phe Tyr 
65 70 75 80 

aat ttc ccg cag ttt atg gec gat ccg ctg tat ctg ttc cgt gtc tgg 288 
Asn Phe Pro Gin Phe Met Ala Asp Pro Leu Tyr Leu Phe Arg Val Trp 
85 90 95 

gac ggc ggc atg tct ttc cac ggc ggc ctg att ggc gtt ate gtg gtg 336 
Asp Gly Gly Met Ser Phe His Gly Gly Leu He Gly Val He Val Val 
100 105 110 

atg att ate ttc gee cgc cgt act aaa cgt tec ttc ttc cag gtc tct 384 
Met He He Phe Ala Arg Arg Thr Lys Arg Ser Phe Phe Gin Val Ser 
115 120 125 

gat ttt ate gca cca etc att ccg ttt ggt ctt ggt gee ggg cgt ctg 432 
Asp Phe He Ala Pro Leu He Pro Phe Gly Leu Gly Ala Gly Arg Leu 
130 135 140 

ggc aac ttt att aac ggt gaa ttg tgg ggc cgc gtt gac ccg aac ttc 480 
Gly Asn Phe He Asn Gly Glu Leu Trp Gly Arg Val Asp Pro Asn Phe 
145 150 155 160 

ccg ttt gec atg ctg ttc cct ggc tec cgt aca gaa gat att ttg ctg 528 
Pro Phe Ala Met Leu Phe Pro Gly Ser Arg Thr Glu Asp He Leu Leu 
165 170 175 

ctg caa ace aac ccg cag tgg caa tec att ttc gac act tac ggt gtg 576 
Leu Gin Thr Asn Pro Gin Trp Gin Ser He Phe Asp- Thr Tyr Gly Val 
180 185 190 

ctg ccg cgc cac cca tea cag ctt tac gag ctg ctg ctg gaa ggt gtg 624 
Leu Pro Arg His Pro Ser Gin Leu Tyr Glu Leu Leu Leu Glu Gly Val 
195 200 205 

gtg ctg ttt att ate etc aac ctg tat att cgt aaa cca cgc cca atg 672 
Val Leu Phe He He Leu Asn Leu Tyr He Arg Lys Pro Arg Pro Met 
210 215 220 

gga get gtc tea ggt ttg ttc ctg att ggt tac ggc gcg ttt cgc ate 720 
Gly Ala Val Ser Gly Leu Phe Leu He Gly Tyr Gly Ala Phe Arg He 
225 230 235 240 

att gtt gag ttt ttc cgc cag ccc gac gcg cag ttt acc ggt gec tgg 768 
He Val Glu Phe Phe Arg Gin Pro Asp Ala Gin Phe Thr Gly Ala Trp 
245 250 255 

gtg cag tac ate age atg ggg caa att ctt tec ate ccg atg att gtc 816 
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Val Gin Tyr lie Ser Met Gly Gin lie Leu Ser lie Pro Met lie Val 
260 265 270 

gcg ggt gtg ate atg atg gtc tgg gca tat cgt cgc age cca cag caa 864 
Ala Gly Val lie Met Met Val Trp Ala Tyr Arg Arg Ser Pro Gin Gin 
275 280 285 



cac gtt tec tga 876 
His Val Ser + 
290 



<210> 183 
<211> 726 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (726) 

<400> 183 

atg gac agt etc aat ctt aat aaa cat att tec ggc cag ttc aac gec 48 

Met Asp Ser Leu Asn Leu Asn Lys His lie Ser Gly Gin Phe Asn Ala 
• 1 5 10 15 

gaa ctg gaa agt ate cgc acg cag gtg atg acc atg ggc ggc atg gtg 96 
Glu Leu Glu .Ser lie Arg Thr Gin Val Met Thr Met Gly Gly Met Val 
20 25 30 

gag cag cag ctt tct gat gca ate acc gcg atg cat aac cag gac age 144 
Glu Gin Gin Leu Ser Asp Ala lie Thr Ala Met His Asn Gin Asp Ser 
35 40 45 

gat ctg gcg aag cgc gtc ate gaa ggc gac aag aac gtc aac atg atg 192 
Asp Leu Ala Lys Arg Val lie Glu Gly Asp Lys Asn Val Asn Met Met 
50 55 60 

gaa gtg gcg ate gat gaa gee tgc gtg cgc att ate gee aaa cgt cag .240 
Glu Val Ala He Asp Glu Ala Cys Val Arg He He Ala Lys Arg Gin 
65 70 75 80 

ccg acg gcg age gac ctg cga ctg gtt atg gtg ate agt aaa acc att 288 
Pro Thr Ala Ser Asp Leu Arg Leu Val Met Val He Ser Lys Thr He 
85 90 95 

gec gag ctg gag cgt att ggc gac gtg gcg gac aaa ate tgc cgt act 336 
Ala Glu Leu Glu Arg He Gly Asp Val Ala Asp Lys He Cys Arg Thr 
100 105 110 

gcg ctg gag aaa ttc tec cag cag cat cag ccg ttg ctg gta agt ctg 384 
Ala Leu Glu Lys Phe Ser Gin Gin His Gin Pro Leu Leu Val Ser Leu 
115 120 125 

gag .teg ctg ggc cgt cat acc ate cag atg ctg cac gac gtg ctg gac 432 
Glu Ser Leu Gly Arg His Thr He Gin Met Leu His Asp Val Leu Asp 
130 135 140 

gcg ttc gcg egg atg gac att gac gaa gcg gta cgt att tat cgt gaa 480 
Ala Phe Ala Arg Met Asp He Asp Glu Ala Val Arg He Tyr Arg Glu 
145 150 155 160 
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gat aaa aaa gtc gat cag gaa tac gaa ggt att gtt cgt caa ctg atg 528 
Asp Lys Lys Val Asp Gin Glu Tyr Glu Gly lie Val Arg Gin Leu Met 
165 170 175 

acc tac atg atg gaa gat teg cgt acc att ccg age gta ctt act gcg 576 
Thr Tyr Met Met Glu Asp Ser Arg Thr lie Pro Ser Val Leu Thr Ala 
180 185 190 

ctg ttc tgc gcg cgt tct ate gaa cgt att ggc gac cgc tgc cag aat 624 
Leu Phe Cys Ala Arg Ser lie Glu Arg He Gly Asp Arg Cys Gin Asn 
195 200 205 

att tgt gag ttt ate ttc tac tac gtg aag ggg cag gat ttc cgt cac 672 
He Cys Glu Phe He Phe Tyr Tyr Val Lys Gly Gin Asp Phe Arg His 
210 215 220 

gtc ggt ggc gat gag ctg gat aaa ctg ctg gcg ggg aaa gat age gac 720 
Val Gly Gly Asp Glu Leu Asp Lys Leu Leu Ala Gly Lys Asp Ser Asp 
225 230 235 240 

aaa taa 726 
Lys * 



<210> 184 
<211> 774 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (774) 

<400> 184 

atg agt atg gtt gaa act gee ccg agt aaa att cag gtt cgt aat ttg 48 

Met Ser Met Val Glu Thr Ala Pro Ser Lys He Gin Val' Arg Asa Leu 
15 10 15 

aac ttc tac tac ggc aaa ttc cat gee ctg aaa aac ate aac ctg gat 96 
Asn Phe Tyr Tyr Gly Lys Phe His Ala Leu Lys Asn He Asn Leu Asp 
20 25 30 

ate get aaa aac cag gta acg gcg ttt ate ggg ccg tec ggc tgc ggt 144 
He Ala Lys Asn Gin Val Thr Ala Phe He Gly Pro Ser Gly Cys Gly 
35 40 45 

aaa teg acg ctg ctg cgt acc ttc aac aaa atg ttt gaa ctg tac ccg 192 
Lys Ser Thr Leu Leu Arg Thr Phe Asn Lys Met Phe Glu Leu Tyr Pro 
50 55 60 

gag cag cgt gcg gaa ggt gaa att ctg ctt gat ggc gac aac ate ctg 240 
Glu Gin Arg Ala Glu Gly Glu He Leu Leu Asp Gly Asp Asn -lie Leu 
65 70 75 80 

acc aac tct cag gat ate gca ctg ctg cgt gcg aaa gtg ggc atg gtg 288 
Thr Asn Ser Gin Asp He Ala Leu Leu Arg Ala Lys Val Gly Met Val 
85 90 95 

ttc cag aaa ccg acg ccg ttt ccg atg tec ate tac gac aac ate get 336 
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Phe Gin Lys Pro Thr Pro Phe Pro Met Ser lie Tyr Asp Asn lie Ala 
100 105 110 

ttt ggc gtt cgt ctg ttt gag aag etc tec cgt gec gac atg gac gag 384 
Phe Gly Val Arg Leu Phe Glu Lys Leu Ser Arg Ala Asp Met Asp Glu 
115 120 125 

cgc gtg cag tgg gca ttg acc aaa gec gca ttg tgg aac gaa acc aaa 432 
Arg Val Gin Trp Ala Leu Thr Lys Ala Ala Leu Trp Asn Glu Thr Lys 
130 135 140 

gat aaa ttg cac cag age ggt tac tct etc tct ggt ggt cag caa cag 480 
Asp Lys Leu His Gin Ser Gly Tyr Ser Leu Ser Gly Gly Gin Gin Gin 
145 150 155 160 

cgt ctg tgt att gcg cgt ggt ate gec att cgc ccg gaa gtg ctg ctg 528 
Arg Leu Cys lie Ala Arg Gly lie Ala lie Arg Pro Glu Val Leu Leu 
165 170 175 

etc gac gaa ccg tgt teg gcg etc gac cct ate tct acc ggg cgt att 576 
Leu Asp Glu Pro Cys Ser Ala Leu Asp Pro lie Ser Thr Gly Arg lie 
180 185 190 

gaa gag ctg ate acc gaa ctg aag cag gat tac acc gtg gtg ate gtc 624 
Glu Glu Leu lie Thr Glu Leu Lys Gin Asp Tyr Thr Val Val lie Val 
195 200 205 

acc cac aac atg cag cag get gcg cgt tgt tec gac cac acg gcg ttt 672 
Thr His Asn Met Gin Gin Ala Ala Arg Cys Ser Asp His Thr Ala Phe 
210 215 220 

atg tac ctg ggc gaa ttg att gag ttc age aac acg gac gat ctg ttc 720 
Met Tyr Leu Gly Glu Leu lie Glu Phe Ser Asn Thr Asp Asp Leu Phe 
225 230 235 240 

acc aag cca gcg aag aaa caa aca gaa gac tac ate acc ggt cgt tac 768 
Thr Lys Pro Ala Lys Lys Gin Thr Glu Asp Tyr lie Thr Gly Arg Tyr 
245 250 255 

ggt tga 774 
Gly * 



<210> 185 
<211> 891 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(891) 

<400> 185 

atg get atg gtt gaa atg caa acc act gcg gcg ctg get gaa tct cgc 48 
Met Ala Met Val Glu Met Gin Thr Thr Ala Ala Leu Ala Glu Ser Arg 
15 10 15 

cgc aaa atg cag gcg cgt cgc cgc etc aaa aac cgt att gcg ctg acg 96 
Arg Lys Met Gin Ala Arg- Arg Arg Leu Lys Asn Arg lie Ala Leu Thr 
20 25 30 
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etc teg atg gcg acg atg gee ttc ggc ctg ttc tgg ctg ate tgg att 144 
Leu Ser Met Ala Thr Met Ala Phe Gly Leu Phe Trp Leu He Trp He 
35 40 45 

tta atg tec ace ate act cgc ggt ate gac ggt atg teg ctg gcg ctg 192 
Leu Met Ser Thr He Thr Arg Gly He Asp Gly Met Ser Leu Ala Leu 
50 55 60 

ttc act gaa atg acg ccg ccg ccc aat acg gaa ggt ggt ggt ctg gcg 240 
Phe Thr Glu Met Thr Pro Pro Pro Asn Thr Glu Gly Gly Gly Leu Ala 
65 70 75 80 

aac get ctg gcg ggt age ggg ctg tta att ttg tgg gec acg gta ttc 288 
Asn Ala Leu Ala Gly Ser Gly Leu Leu He Leu Trp Ala Thr Val Phe 
85 90 95 

ggt acg ccg ctg ggc att atg gcg ggg att tat ctg gcg gaa tat ggt 336 
Gly Thr Pro Leu Gly He Met Ala Gly He Tyr Leu Ala Glu Tyr Gly 
100 105 110 

cgt aaa tec tgg ctg gca gaa gtg att cgc ttc att aac gac att ctg 384 
Arg Lys Ser Trp Leu Ala Glu Val He Arg Phe He Asn Asp He Leu 
115 120 125 

etc tct gcg ccg teg att gtg gtt ggt ctg ttt gtt tac acc att gtg 432 
Leu Ser Ala Pro Ser He Val Val Gly Leu Phe Val Tyr Thr He Val 
130 135 140 

gtg gcg cag atg gag cac ttc tec ggc tgg gcg ggc gtg att gee ctg 480 
Val Ala Gin Met Glu His Phe Ser Gly Trp Ala Gly Val He Ala Leu 
145 150 155 160 

, gcg ttg ttg cag gtg ccg att gtt ate cgc acc acc gag aac atg ctg 528 
Ala Leu Leu Gin Val Pro He Val He Arg Thr Thr Glu Asn Met Leu 
165 170 175 

aaa ctg gtg ccg tac age ctg cgt gaa gcg get tat gcg ctg ggt aca 576 
Lys Leu Val Pro Tyr Ser Leu Arg Glu Ala Ala Tyr Ala Leu Gly Thr 
180 185 190 

ccg aag tgg aag atg ate tct gcg att acg ctg aaa gcg teg gtg tec 624 
Pro Lys Trp Lys Met He Ser Ala He Thr Leu Lys Ala Ser Val Ser 
195 200 205 

ggg att atg acc ggt ate ctg ctg gcg att gee cgt att get ggt gaa 672 
Gly He Met Thr Gly He Leu Leu Ala He Ala Arg He Ala Gly Glu 
210 215 . 220 

acc gcg ccg ctg ctg ttt acc gcg etc tec aac cag ttc tgg age acg 720 
Thr Ala Pro Leu Leu Phe Thr Ala Leu Ser Asn Gin Phe Trp Ser Thr 
225 230 235 240 

gac atg atg cag ccg ate gee aac ctg ccg gtg acg ate ttt aag ttt 768 
Asp Met Met Gin Pro He Ala Asn Leu Pro Val Thr He Phe Lys Phe 
245 250 255 

gcg atg age ccg ttt gcg gaa tgg cag caa ttg gee tgg gee ggg gta 816 
Ala Met Ser Pro Phe Ala Glu Trp Gin Gin Leu Ala Trp Ala Gly Val 
260 265 270 
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ttg ate att acc ctg tgc gta ctg ctg ctg aac att ctg gcg cgc gtt 864 
Leu lie lie Thr Leu Cys Val Leu Leu Leu Asn lie Leu Ala Arg Val 
275 280 285 

gtt ttt gcg aag aat aaa cac ggt tga 891 
Val Phe Ala Lys Asn Lys His Gly * 
290 295 



<210> 186 
<211> 960 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (960) 

<400> 186 

atg get gca acc aag cct get ttt aac cca ccg ggt aaa aag ggc gac 48 
Met Ala Ala Thr Lys Pro Ala Phe Asn Pro Pro Gly Lys Lys Gly Asp 
15 10 15 

at a att ttc age gtg ctg gta aaa ctg gcg gcg ctg att gtg eta ttg 96 
lie lie Phe Ser Val Leu Val Lys Leu Ala Ala Leu He Val Leu Leu 
20 25 30 

atg ttg ggt ggc att att gtc tct ctg ate ate tec tec tgg ccg age 144 
Met Leu Gly Gly He He Val Ser Leu He He Ser Ser Trp Pro Ser 
35 40 45 

att cag aaa ttt ggt ctg get ttc eta tgg acc aaa gag tgg gat gca 192 
He Gin Lys Phe Gly Leu Ala Phe Leu Trp Thr Lys Glu Trp Asp Ala 
50 55 60 

ccg .aac gat ate tac ggg gcg ctg gtg ccg ate tac ggt acg ttg gtg 240 
Pro Asn Asp He Tyr Gly Ala Leu Val Pro He Tyr Gly Thr Leu Val 
65 70 75 80 

act teg ttt ate gcg ctg ctg ate gee gtc ccg gtg agt ttc ggt ate 288 
Thr Ser Phe He Ala Leu Leu He Ala Val Pro Val Ser Phe Gly He 
85 90 95 

gee ctg ttc ctg act gag ctt gcg cct ggc tgg ctg aaa cgc ccg ctg 336 
Ala Leu Phe Leu Thr Glu Leu Ala Pro Gly Trp Leu Lys Arg Pro Leu 
100 105 110 

ggt ate gee att gag ctg ctg gca gee att cca agt ate gtt tac ggc 384 
Gly He Ala He Glu Leu Leu Ala Ala He Pro Ser He Val Tyr Gly 
115 120 125 

atg tgg ggc ctg ttt ate ttt gcg ccg ctg ttc gec gtt tac ttt cag 432 
Met Trp Gly Leu Phe He Phe Ala Pro Leu Phe Ala Val Tyr Phe Gin 
130 135 140 

gag ccg gtc ggc aat ate atg teg aat ate ccg att gtt ggc gcg ctg 480 
Glu Pro Val Gly Asn He Met Ser Asn He Pro He Val Gly Ala Leu 
145 150 155 160 

ttc tct ggc ccc gca ttt ggt ate ggt ate etc gcg gca ggc gtg ate 528 
Phe Ser Gly Pro Ala Phe Gly He Gly He Leu Ala Ala Gly Val He 
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165 170 . 175 

etc gec ate atg att att ccg tac att gcg gcg gta atg cgt gat gtg 576 
Leu Ala He Met He He Pro Tyr He Ala Ala Val Met Arg Asp Val 
180 185 190 

ttc gaa caa acc ccg gtg atg atg aaa gag teg gec tac ggt att ggc 624 
Phe Glu Gin Thr Pro Val Met Met Lys Glu Ser Ala Tyr Gly He Gly 
195 200 205 

tgc acc acc tgg gaa gtt ate tgg cgt ate gtt ctt ccg ttc acc aaa 672 
Cys Thr Thr Trp Glu Val He Trp Arg He Val Leu Pro Phe Thr Lys 
210 215 220 

aat ggt gtt ate ggc ggc ate atg ctg ggg ctg ggc cgc gcg etc ggt 720 
Asn Gly Val He Gly Gly He Met Leu Gly Leu Gly Arg Ala Leu Gly 
225 ■ 230 235 240 

gaa acc atg gcg gtg acc ttt ate ate ggt aac acc tac cag etc gac 768 
Glu Thr Met Ala Val Thr Phe He He Gly Asn Thr Tyr Gin Leu Asp 
245 250 255 

age gee teg ctg tat atg ccg ggc aac agt ate acc tct gcg ctg gcg 816 
Ser Ala Ser Leu Tyr Met Pro Gly Asn Ser He Thr Ser Ala Leu Ala 
260 265 270 

aac gaa ttt gcg gaa gcg gaa tec ggt ctg cac gtt gec gca ctg atg 864 
Asn Glu Phe Ala Glu Ala Glu Ser Gly Leu His Val Ala Ala Leu Met 
275 280 285 

gaa ctg ggc ctg ate ctg ttt gtg att acc ttc ate gtc etc gec gca 912 
Glu Leu Gly Leu He Leu Phe Val He Thr Phe He Val Leu Ala Ala 
290 295 300 

teg aag ttt atg att atg cgc ctg get aag aat gag ggg gca cgc taa 960 
Ser Lys Phe Met He Met Arg Leu Ala Lys Asn Glu Gly Ala Arg * 
305 310 315 



<210> 187 
<211> 1041 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1041) 

<400> 187 

atg aaa gtt atg cgt acc acc gtc gca act gtt gtc gec gcg acc tta 48 

Met Lys Val Met Arg Thr Thr Val Ala Thr Val Val Ala Ala Thr Leu 

15 10 15 

teg atg agt get ttc tct gtg ttt gca gaa gca age ctg aca ggt gca 96 
Ser Met Ser Ala Phe Ser Val Phe Ala Glu Ala Ser Leu Thr Gly Ala 
20 25 30 

ggt gca acc ttc cct gcg ccg gtg tat gee aaa tgg get gac act tac 144 
Gly Ala Thr Phe Pro Ala Pro Val Tyr Ala Lys Trp Ala Asp Thr Tyr 
35 40 45 
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cag aaa gaa acc ggt aat aaa gtt aac tac cag ggt ate ggt tct tec 192 
Gin Lys Glu Thr Gly Asn Lys Val Asn Tyr Gin Gly lie Gly Ser Ser 
50 55 60 

ggt ggc gta aaa cag att ate get aat acc gtt gat ttt ggt gee tct 240 
Gly Gly Val Lys Gin He He Ala Asn Thr Val Asp Phe Gly Ala Ser 
65 70 75 80 

gac gcg ccg ctg tct gac gaa aaa ctg get cag gaa ggt ctg ttc cag 288 
Asp Ala Pro Leu Ser Asp Glu Lys Leu Ala Gin Glu Gly Leu Phe Gin 
85 90 95 

ttc ccg acc gtg att ggc ggc gtg gtg ctg gcg gtt aac att cca ggg 336 
Phe Pro Thr Val He Gly Gly Val Val Leu Ala Val Asn He Pro Gly 
100 105 110 

ctg aag tct ggc gaa ctg gtg ctg gat ggt aaa acc etc ggc gac ate 384 
Leu Lys Ser Gly Glu Leu Val Leu Asp Gly Lys Thr Leu Gly Asp He 
115 120 125 

tac ctg ggc aaa ate aag aag tgg gat gat gaa gec ate gee aaa ctg 432 
Tyr Leu Gly Lys He Lys Lys Trp Asp Asp Glu Ala He Ala Lys Leu 
130 135 140 

aat ccg ggt ctg aaa ctg cct tea caa aac att get gta gta cgc cgc 480 
Asn Pro Gly Leu Lys Leu Pro Ser Gin Asn He Ala Val Val Arg Arg 
145 ~ 150 155 160 

gca gat ggc tec ggg act tec ttc gtc ttc acc age tac ctg gcg aaa 528 
Ala Asp Gly Ser Gly Thr Ser Phe Val Phe Thr Ser Tyr Leu Ala Lys 
165 170 175 

gtg aac gaa gag tgg aaa aac aac gtt ggt act ggc tct acc gta aaa 576 
Val Asn Glu Glu Trp Lys Asn Asn Val Gly Thr Gly Ser Thr Val Lys 
180 185 190 

tgg ccg ate ggt ctg ggc ggt aaa ggt aac gac ggt ate gee gcg ttc 624 
Trp Pro He Gly Leu Gly Gly Lys Gly Asn Asp Gly He Ala Ala Phe 
195 * 200 205 

gtt cag cgt ctg ccg ggt gca att ggt tat gtt gaa tat get tac gcg 672 
Val Gin Arg Leu Pro Gly Ala He Gly Tyr Val Glu Tyr Ala Tyr Ala 
210 215 220 

aag cag aac aac ctg gcg tac acc aaa ctg ate tec get gat ggt aaa 720 
Lys Gin Asn Asn Leu Ala Tyr Thr Lys Leu He Ser Ala Asp Gly Lys 
225 230 235 240 

ccg gtt agt ccg acc gaa gaa aac ttc get aat gca gca aaa ggt gca 768 
Pro Val Ser Pro Thr Glu Glu Asn Phe Ala Asn Ala Ala Lys Gly Ala 
245 250 255 

gac tgg age aaa acc ttc get cag gat ctg acc aac cag aaa ggc gaa 816 
Asp Trp Ser Lys Thr Phe Ala Gin Asp Leu Thr Asn Gin Lys Gly Glu 
260 265 270 

gat gca tgg cct att acc tct acc acg ttc att ctg ate cac aaa gat 864 
Asp Ala Trp Pro He Thr Ser Thr Thr Phe He Leu He His Lys Asp 
275 280 285 
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cag aag aaa cca gaa caa ggc aca gaa gtg ctg aaa ttc ttc gac tgg 912 
Gin Lys Lys Pro Glu Gin Gly Thr Glu Val Leu Lys Phe Phe Asp Trp 
290 295 300 

gcg tac aaa acc ggg get aaa cag gcg aac gac ctg gat tac gec age 960 
Ala Tyr Lys Thr Gly Ala Lys Gin Ala Asn Asp Leu Asp Tyr Ala Ser 
305 310 315 320 

ctg ccg gat agt gta gtt gaa cag gtt cgc get gcg tgg aag acc aat 1008 
Leu Pro Asp Ser Val Val Glu Gin Val Arg Ala Ala Trp Lys Thr Asn 
325 330 335 

att aaa gac agt age ggt aag ccg ctg tac taa 1041 
lie Lys Asp Ser Ser Gly Lys Pro Leu Tyr * 
340 345 



<210> 188 
<211> 711 
c212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (711) 

<400> 188 

atg gga tct gga ctg gtg aac gga ggt gat tat ttt tat aat aac ctt 48 
Met Gly Ser Gly Leu Val Asn Gly Gly Asp Tyr Phe ^yr Asn Asn Leu 
15 10 15 

tea ttc aca gtc acc agg tac aac ggc ate atg gca act gac tea aca 96 
Ser Phe Thr Val Thr Arg Tyr Asn Gly lie Met Ala Thr Asp Ser Thr 
20 25 30 

caa tgt gta aaa aaa age cgt ggc cgc cca aaa gtg ttc gac agg gat 144 
Gin Cys Val Lys Lys Ser Arg Gly Arg Pro Lys Val Phe Asp Arg Asp 
35 40 45 

gee gcg ctt gat aag gee atg aaa ttg ttc tgg caa cac ggt tat gaa 192 
Ala Ala Leu Asp Lys Ala Met Lys Leu Phe Trp Gin His Gly Tyr Glu 
50 55 60 

gcg act tct ctt gcg gac etc gtc gaa gcg acc gga gee aaa gcg ccc 240 
Ala Thr Ser Leu Ala Asp Leu Val Glu Ala Thr Gly Ala Lys Ala Pro 
65 70 75 80 

acg eta tac gcg gaa ttt acc aac aaa gag ggg tta ttt cgt gee gtt 288 
Thr Leu Tyr Ala Glu Phe Thr Asn Lys Glu Gly Leu Phe Arg Ala Val 
85 90 95 

etc gac cgc tat ate gat cgt ttt gee get aag cat gaa gca cag ctg 336 
Leu Asp Arg Tyr He Asp Arg Phe Ala Ala Lys His Glu Ala Gin Leu 
100 105 110 

ttt tgt gaa gag aaa age gtg gag tct gcg ctg get gac tat ttt get 384 
Phe Cys Glu Glu Lys Ser Val Glu Ser Ala Leu Ala Asp Tyr Phe Ala 
115 120 125 

gee ate gee aac tgc ttt acc age aaa gac acc ccg gca ggc tgc ttc 432 
Ala He Ala Asn Cys Phe Thr Ser Lys Asp Thr Pro Ala Gly Cys Phe 
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130 135 140 

atg ate aac aac tgc acc acc etc tec cca gat tea gga gat ate gee - 480 
Met lie Asn Asn Cys Thr Thr Leu Ser Pro Asp Ser Gly Asp lie Ala 
145 • 150 155 160 

aat acg ttg aaa tea cgc cat gcg atg caa gag cgc act ttg cag cag 528 
Asn Thr Leu Lys Ser Arg His Ala Met Gin Glu Arg Thr Leu Gin Gin 
165 170 175 

ttt tta tgt caa cga caa gcg cgc ggg gaa ate ccg ccc cac tgt gac 576 
Phe Leu Cys Gin Arg Gin Ala Arg Gly Glu lie Pro Pro His Cys Asp 
180 185 190 

gtg aca cat ctg gca gaa ttc ctt aat tgt att att cag ggg atg teg 624 
Val Thr His Leu Ala Glu Phe Leu Asn Cys lie lie Gin Gly Met Ser 
195 200 205 

ate age gca cgc gaa ggt gca teg ctg gaa aaa ctg atg cag att gee 672 
He Ser Ala Arg Glu Gly Ala Ser Leu Glu Lys Leu Met Gin He Ala 
210 215 220 

gga acg act ttg cgt tta tgg ccc gaa ctg gtg aaa taa 711 
Gly Thr Thr Leu Arg Leu Trp Pro Glu Leu Val Lys * 
225 230 235 



<210> 189 
<211> 546 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (546) 

<400> 189 

gtg cag gec aaa att gcg gca tea aat acg ggt gaa ctg gat gee ctg 48 
Met Gin Ala Lys He Ala Ala Ser Asn Thr Gly Glu Leu Asp Ala Leu 
1 5 10 15 

caa cag ctg gga ttc tec ctg gta gaa ggt gaa gtt gat ttg gcg eta 96 
Gin Gin Leu Gly Phe Ser Leu Val Glu Gly Glu Val Asp Leu Ala Leu 
20 25 30 

ccc gtg aac aat gec agt gat age ggt get gta gtg gca caa gag acc 144 
Pro Val Asn Asn Ala Ser Asp Ser Gly Ala Val Val Ala Gin Glu Thr 
35 40 45 

gat att ccc gca tta cgt cag tta gee age gec gca ttt gcg caa age 192 
Asp He Pro Ala Leu Arg Gin Leu Ala Ser Ala Ala Phe Ala Gin Ser 
50 55 ,60 

cgt ttt cgt gcg ccg tgg tat gcg cct gac gee age agt cgc ttt tat 240 
Arg Phe Arg Ala Pro Trp Tyr Ala Pro Asp Ala Ser Ser Arg Phe Tyr 
65 70 75 80 

gca cag tgg att gaa aat gee gtg cgc ggc acc ttt gat cat caa tgt 288 
Ala Gin Trp He Glu Asn Ala Val Arg Gly Thr Phe Asp His Gin Cys 
85 90 95 



-149- 



WO 01/48209 



PCT/US00/34419 



ctg att tta cgt gcg gcg tec ggc gat att cgc ggc tat gtc tct tta 336 
Leu lie Leu Arg Ala Ala Ser Gly Asp He Arg Gly Tyr Val Ser Leu 
100 105 110 

egg gaa etc aat gcg aca gat gcg cga att ggc ctg ctg get gga'cgc 384 
Arg Glu Leu Asn Ala Thr Asp Ala Arg He Gly Leu Leu Ala Gly Arg 
115 120 125 

ggt gca ggt get gag ctg atg caa acg gcg eta aac tgg gcg tat cgt 432 
Gly Ala Gly Ala Glu Leu Met Gin Thr Ala Leu Asn Trp Ala Tyr Arg 
130 135 140 

cgc ggt aaa aca act ttg egg gtg gcg ace caa atg ggc aac acc gec 480 
Arg Gly Lys Thr Thr Leu Arg Val Ala Thr Gin Met Gly Asn Thr Ala 
145 150 155 160 

gcg ctt aaa cga tac ata caa agt ggt gcg aat gta gaa age acc gcg 528 
Ala Leu Lys Arg Tyr He Gin Ser Gly Ala Asn Val Glu Ser Thr Ala 
165 170 175 

tac tgg tta tac agg tga 546 
Tyr Trp Leu Tyr Arg * 
180 



<210> 190 
<211> 1131 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1131) 
<400> 190 

atg att cca ttt aac gca ccg ccg gtg gtg gga acc gaa etc gac tat 48 
Met He Pro Phe Asn Ala Pro Pro Val Val Gly Thr Glu Leu Asp Tyr 
15 10 15 

atg cag teg gca atg ggt age ggc aaa ctg tgt ggc gat ggc ggt ttt 96 
Met Gin Ser Ala Met Gly Ser Gly Lys Leu Cys Gly Asp Gly Gly Phe 
20 25 30 

acc cgt cgc tgc cag cag tgg ctg gag caa cgt ttt ggc age gee aaa 144 
Thr Arg Arg Cys Gin Gin Trp Leu Glu Gin Arg Phe Gly Ser Ala Lys 
35 40 45 

gtg tta ctg acg ccg tec tgc acc get teg ctg gag atg gcg gcg ctg 192 
Val Leu Leu Thr Pro Ser Cys Thr Ala Ser Leu Glu Met Ala Ala Leu 
50 55 60 

ctg etc gat ate cag cct ggc gat gaa gtg ate atg ccg age tac acc 240 
Leu Leu Asp He Gin Pro Gly Asp Glu Val He Met Pro Ser Tyr Thr 
65 70 75 80 

ttt gtc tec acc gee aat gee ttt gtg ctg cgt ggc gca aaa ate gtt 288 
Phe Val Ser Thr Ala Asn Ala Phe Val Leu Arg Gly Ala Lys He Val 
85 90 95 

ttt gtg gat gtt cgc ccg gac acc atg aac ate gac gaa acg ctg att 336 
Phe Val Asp Val Arg Pro Asp Thr Met Asn He Asp Glu Thr Leu He 
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100 105 HO 

gaa gcg gcg ate acc gac aaa a eg cgc gtt ate gtg ccg gtc cat tac 384 
Glu Ala Ala lie Thr Asp Lys Thr Arg Val lie Val Pro Val His Tyr 
115 120 125 

gcg ggt gtg gee tgc gaa atg gac acc att atg gcg ttg gcg aaa aag 432 
Ala Gly Val Ala Cys Glu Met Asp Thr lie Met Ala Leu Ala Lys Lys 
130 135 140 

cat aat ttg ttt gtg gta gaa gat gec get cag ggc gtg atg tec act 480 
His Asn Leu Phe Val Val Glu Asp Ala Ala Gin Gly Val Met Ser Thr 
145 150 155 160 

tac aaa ggg cgt gca ctg gga acc att ggt cat att ggc tgc ttt age 528 
Tyr Lys Gly Arg Ala Leu Gly Thr He Gly His He Gly Cys Phe Ser 
165 170 175 

ttc cat gaa acc aaa aac tac acg gcg ggc ggt gaa ggc ggc gcg acg 576 
Phe His Glu Thr Lys Asn Tyr Thr Ala Gly Gly Glu Gly Gly Ala Thr 
180 185 190 

ctg att aac gat aaa gcg tta ate gaa cga gec gag ate ate cgt gaa 624 
Leu He Asn Asp Lys Ala Leu He Glu Arg Ala Glu lie lie Arg Glu 
195 200 205 

aag ggc act aac cgc age cag ttc ttc cgt ggt cag gtc gat aaa tat 672 
Lys Gly Thr Asn Arg Ser Gin Phe Phe Arg Gly Gin Val Asp Lys Tyr 
210 215 220 

acc tgg cgc gat att ggc tec age tat ttg atg tec gat ctg caa get 720 
Thr Trp Arg Asp He Gly Ser Ser Tyr Leu Met Ser Asp Leu Gin Ala 
225 230 235 240 

gca tac ctg tgg gcg caa ctg gaa gca gcg gat cgt ate aac cag caa 768 
Ala Tyr Leu Trp Ala Gin Leu Glu Ala Ala Asp Arg He Asn Gin Gin 
245 250 255 

cgt ctg gcg ctg tgg caa aac tac tac gat gcg tta gcg cct ctg gcg 816 
Arg Leu Ala Leu Trp Gin Asn Tyr Tyr Asp Ala Leu Ala Pro Leu Ala 
260 265 270 

aaa gee ggg cgt ate gag ctg ccg teg att ccc gat ggc tgc gtg cag 864 
Lys Ala Gly Arg He Glu Leu Pro Ser He Pro Asp Gly Cys Val Gin 
275 280 285 

aac gcg cat atg ttc tac att aaa ctg egg gat att gat gac egg age 912 
Asn Ala His Met Phe Tyr He Lys Leu Arg Asp He Asp Asp Arg Ser 
290 295 300 

gcg ttg att aac ttt ctg aaa gaa gcg gaa ate atg gcg gtg ttt cat 960 
Ala Leu He Asn Phe Leu Lys Glu Ala Glu He Met Ala Val Phe His 
305 310 315 320 



tac att ccg ctg cac ggt tgc cct gcg ggg gaa cac' ttt ggt. gag ttc 
Tyr He Pro Leu His Gly Cys Pro Ala Gly Glu His Phe Gly Glu Phe 
325 330 335 



1008 



cac ggt gaa gat cgc tac acc acc aaa gag age gag cgc ctg ctg cgc 1056 
His Gly Glu Asp Arg Tyr Thr Thr Lys Glu Ser Glu Arg Leu Leu Arg 
340 345 350 
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ctg ccg ctg ttc tac aac ctg teg ccc gtc aat cag cgt acg gta att 1104 
Leu Pro Leu Phe Tyr Asn Leu Ser Pro Val Asn Gin Arg Thr Val lie 
355 360 365 

gcg act ttg ttg aac tac ttt tec tga 1131 
Ala Thr Leu Leu Asn Tyr Phe Ser * 
370 375 



<210> 191 
<211> 1251 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1251) 

<400> 191 

atg teg ttg gca aaa gcg tec ttg tgg acg gcg gee agt aca ctg gtc 48 

Met Ser Leu Ala Lys Ala Ser Leu Trp Thr Ala Ala Ser Thr Leu Val 
1 5 10 15 

aag att ggt gee ggg tta ctg gtc ggt aag ttg ctg gcg gtg tea ttt 96 
Lys lie Gly Ala Gly Leu Leu Val Gly Lys Leu Leu Ala Val Ser Phe 
20 25 30 

ggt ccg gcg ggg ctt ggg ctg gcg gca aat ttc cgc cag ttg att ace 144 
Gly Pro Ala Gly Leu Gly Leu Ala Ala Asn Phe Arg Gin Leu lie Thr 
35 40 45 

gtg etc ggc gtg ctt gee ggg get ggc ate ttt aac ggt gta ace aaa 192 
Val Leu Gly Val Leu Ala Gly Ala Gly lie Phe Asn Gly Val Thr Lys 
50 55 t 60 

tac gtt gee cag tac cat gat aat ccg caa cag ctg cgc cgc gtg gtc 240 
Tyr Val Ala Gin Tyr His Asp Asn Pro Gin Gin Leu Arg Arg Val Val 
65 70 75 80 

ggc act tea tea gcg atg gta ctt ggt ttc tct acg ctg atg gcg ctg 288 
Gly Thr Ser Ser Ala Met Val Leu Gly Phe Ser Thr Leu Met Ala Leu 
85 90 95 

gtt ttt gtg ctg gca get gcg cca ate age cag gga ttg ttt ggt aat 336 
Val Phe Val Leu Ala Ala Ala Pro He Ser Gin Gly Leu Phe Gly Asn 
100 105 110 

ace gac tat cag ggg ctg gtg cgt tta gtg gcg ctg gtg caa atg ggg 384 
Thr Asp Tyr Gin Gly Leu Val Arg Leu Val Ala Leu Val Gin Met Gly 
115 120 125 

ate gec tgg ggc aac ctg tta ctg gcg ctg atg aaa ggc ttt cgc gat 432 
He Ala Trp Gly Asn Leu Leu Leu Ala Leu Met Lys Gly Phe Arg Asp 
130 135 140 

gee gca ggt aat gcg tta teg ctg att gtc ggc age ttg att ggc gtt 480 
Ala Ala Gly Asn Ala Leu Ser Leu He Val Gly Ser Leu He Gly Val 
145 150 155 160 

etc gcg tac tac gtc agt tac cgt ttg ggc ggt tat gaa ggg gcg ttg 528 
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Leu Ala Tyr Tyr Val Ser Tyr Arg Leu Gly Gly Tyr Glu Gly Ala Leu 
165 170 175 

ctg ggt ctg gcg ctg att ccc gcg ctg gtg gta att cct gcc gcc ate 576 
Leu Gly Leu Ala Leu lie Pro Ala Leu Val Val lie Pro Ala Ala lie 
180 185 190 

atg ttg ate aaa cgt ggt gtc ate ccg tta age tat ctg aaa ccc age 624 
Met Leu lie Lys Arg Gly Val lie Pro Leu Ser Tyr Leu Lys Pro Ser 
195 200 205 

tgg gat aac ggt ctg gca ggg cag ttg age aaa ttt acg etc atg gcg 672 
Trp Asp Asn Gly Leu Ala Gly Gin Leu Ser Lys Phe Thr Leu Met Ala 
210 215 220 

ttg att acg teg gtg acc ttg cct gtt get tac ate atg atg cgt aaa 720 
Leu lie Thr Ser Val Thr Leu Pro Val Ala Tyr lie Met Met Arg Lys 
225 230 235 240 

ctg ctg gcg gcg cag tat age tgg gat gag gtg ggg ate tgg caa ggg 768 
Leu Leu Ala Ala Gin Tyr Ser Trp Asp Glu Val Gly He Trp Gin Gly 
245 250 255 

gtg age agt att tec gat gcc tac ctg caa ttt att acg gca teg ttc 816 
Val Ser Ser He Ser Asp Ala Tyr Leu Gin Phe lie Thr Ala Ser Phe 
260 265 270 

age gta tat ttg ctg ccc acg ttg teg egg eta acg gaa aag cgc gat 864 
Ser Val Tyr Leu Leu Pro Thr Leu Ser Arg Leu Thr Glu Lys Arg Asp 
275 280 285 

ate acc egg gaa gtg gtt aaa teg ctg aaa ttc gtc tta ccg gca gtg 912 
He Thr Arg Glu Val Val Lys Ser Leu Lys Phe Val Leu Pro Ala Val 
290 295 300 

gcg gcg gcg agt ttt acc gtc tgg ctg ctg cgt gat ttt get ate tgg 960 
Ala Ala Ala Ser Phe Thr Val Trp Leu Leu Arg Asp Phe Ala He Trp 
305 310 315 320 

ctg ctg ttg teg aat aaa ttt acc get atg cgc gat etc ttt gcc tgg 1008 
Leu Leu Leu Ser Asn Lys Phe Thr Ala Met Arg Asp Leu Phe Ala Trp' 
325 330 335 

cag tta gtg ggt gat gtg tta aaa gtg ggc get tat gtc ttt ggt tat 1056 
Gin Leu Val Gly Asp Val Leu Lys Val Gly Ala Tyr Val Phe Gly Tyr 
340 345 350 

ctg gtg ate gcc aaa gcg tea ctg egg ttt tat att ctg gcg gaa gtc 1104 
Leu Val He Ala Lys Ala Ser Leu Arg Phe Tyr He Leu Ala Glu Val 
355 360 365 

age cag ttc act tta ttg atg gta ttt gcc cac tgg eta ate cct gcg 1152 
Ser Gin Phe Thr Leu Leu Met Val Phe Ala His Trp Leu He Pro Ala 
370 375 380 

cat ggt gca ctg ggc gcg gcg cag gca tat atg gca act tat ate gtc 1200 
His Gly Ala Leu Gly Ala Ala Gin Ala Tyr Met Ala Thr Tyr He Val 
385 390 395 400 

tat ttt tct ctt tgt tgt ggc gtg ttt tta etc tgg cgt agg egg gca 1248 
Tyr Phe Ser Leu Cys Cys Gly Val Phe Leu Leu Trp Arg Arg Arg Ala 
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405 410 415 

tga 1251 



<210> 192 
<211> 1353 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1353) 

<400> 192 

atg agt ctg ctg caa ttc agt ggc ctg ttt gtt gtc tgg ctg etc tgc 48 

Met Ser Leu Leu Gin Phe Ser Gly Leu Phe Val Val Trp Leu Leu Cys 
15 10 15 

acg ctg ttt att gec acg ctg acc tgg ttt gag ttt cgc cgt gtg cgc 96 
Thr Leu Phe lie Ala Thr Leu Thr Trp Phe Glu Phe Arg Arg Val Arg 
20 25 30 

ttt aac ttc aat gtc ttc ttt tea ttg ctg ttt ttg etc acc ttt ttc 144 
Phe Asn Phe Asn Val Phe Phe Ser Leu Leu Phe Leu Leu Thr Phe Phe 
35 40 45 

ttc ggc ttc ccg ctg acc age gtg ctg gta ttt cgc ttt gat gtt ggt 192 
Phe Gly Phe Pro Leu Thr Ser Val Leu Val Phe Arg Phe Asp Val Gly 
50 55 60 

gtc gcg ccg cca gaa ate ttg ttg cag gcg ttg ctt tct gcg ggc tgc 240 
Val Ala Pro Pro Glu lie Leu Leu Gin Ala Leu Leu Ser Ala Gly Cys 
65 70 75 80 

ttc tac gcg gtt tac tat gtc acc tac aaa acc cgc eta egc aaa cgc 288 
Phe Tyr Ala Val Tyr Tyr Val Thr Tyr Lys Thr Arg Leu Arg Lys Arg 
85 90 95 

gtt get gat gta ccg cgc cgt ccg ctg ttt acc atg aac cgc gtg gag 336 
Val Ala Asp Val Pro Arg Arg Pro Leu Phe Thr Met Asa Arg Val Glu 
100 105 110 

acc aat ctt acg tgg gtg ate ctg atg ggt ate gcg ctg gta age gtc 384 
Thr Asn Leu Thr Trp Val lie Leu Met Gly lie Ala Leu Val Ser Val 
115 120 125 

ggc ate ttc ttc atg cac aac ggc ttt ttg ctg ttc egg ctt aac tec 432 
Gly lie Phe Phe Met His Asn Gly Phe Leu Leu Phe Arg Leu Asn Ser 
130 135 140 

tac agt cag ate ttt tec agt gaa gtc tec ggc gtg gcg tta aaa cgc 480 
Tyr Ser Gin lie Phe Ser Ser Glu Val Ser Gly Val Ala Leu Lys Arg 
145 150 155 160 

ttc ttt tac ttt ttc ate ccg gcg atg ctg gtg gtc tac ttt ctg cgc 528 
Phe Phe Tyr Phe Phe lie Pro Ala Met Leu Val Val Tyr Phe Leu Arg 
165 170 175 
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cag gac age aaa gcg tgg ctg ttt ttc etc gtc age acg gtc gee ttt 576 
Gin Asp Ser Lys Ala Trp Leu Phe Phe Leu Val Ser Thr Val Ala Phe 
180 185 190 

ggc ttg ctg act tat atg att gtc ggc ggc act cgc gee aat ate ate 624 
Gly Leu Leu Thr Tyr Met lie Val Gly Gly Thr Arg Ala Asn lie lie 
195 200 205 

ate gca ttc get ate ttc ctg ttt att ggc att att cgc ggc tgg att 672 
lie Ala Phe Ala lie Phe Leu Phe lie Gly lie lie Arg Gly Trp lie 
210 215 220 

teg ttg tgg atg ctg gcg gcg gcg ggc gtg ctg ggg att gtt ggc atg 720 
Ser Leu Trp Met Leu Ala Ala Ala Gly Val Leu Gly lie Val Gly Met 
225 230 235 240 

ttc tgg ctg gca eta aaa cgc tat gga atg aat gtg age ggc gat gaa 768 
Phe Trp Leu Ala Leu Lys Arg Tyr Gly Met Asn Val Ser Gly Asp Glu 
245 250 255 

gcg ttc tat ' acg ttt etc tat etc act cgc gac ace ttc teg ccg tgg 816 
Ala Phe Tyr Thr Phe Leu Tyr Leu Thr Arg Asp Thr Phe Ser Pro Trp 
260 265 270 

gag aat ctg gcg ttg ctg ttg cag aac tac gac aac ate gac ttc cag 864 
Glu Asn Leu Ala Leu Leu Leu Gin Asn Tyr Asp Asn lie Asp Phe Gin 
275 280 285 

ggc ctg get cca att gtc cgc gat ttc tat gtc ttt ate cct tec tgg 912 
Gly Leu Ala Pro He Val Arg Asp Phe Tyr Val Phe He Pro Ser Trp 
290 295 300 

ctg tgg ccg ggt cgc ccg agt atg gtg ctg aac tea gee aac tac ttt 960 
Leu Trp Pro Gly Arg Pro Ser Met Val Leu Asn Ser Ala Asn Tyr Phe 
305 310 315 320 

ace tgg gaa gtg ctg aat aac cac tec gga ctg gcg ate teg cct acg 1008 
Thr Trp Glu Val Leu Asn Asn His Ser Gly Leu Ala He Ser Pro Thr 
325 330 335 

ctt ata ggc tea ctg gtg gtg atg ggc ggc gcg ttg ttc ate ccg etc 1056 
Leu He Gly Ser Leu Val Val Met Gly Gly Ala Leu Phe He Pro Leu 
340 345 350 

ggg gcg ate gtg gtt ggt ctg ate ate aaa tgg ttc gac tgg ctg tat 1104 
Gly Ala He Val Val Gly Leu He He Lys Trp Phe Asp Trp Leu Tyr 
355 360 365 

gag ctg ggc aac cgc gag cct aat cgc tat aaa get gcg ata ttg cac 1152 
Glu Leu Gly Asn Arg Glu Pro Asn Arg Tyr Lys Ala Ala He Leu His 
370 375 380 

agt ttc tgc ttt ggg gcg ate ttc aat atg ate gtg ctg gcg cgt gaa 1200 
Ser Phe Cys Phe Gly Ala He Phe Asn Met He Val Leu Ala Arg Glu 
385 390 ,395 400 

ggg ctg gat teg ttt gtc tea cgc gtg gtc ttt ttt ate gtg gtc ttc 1248 
Gly Leu Asp Ser Phe Val Ser Arg Val Val Phe Phe He Val Val Phe 
405 410 415 

ggc gca tgt ctg atg ate gca aaa ctg ttg tac tgg ctt ttt gaa age 1296 
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Gly Ala Cys Leu Met lie Ala Lys t Leu Leu Tyr Trp Leu Phe Glu Ser 
420 425 \ 430 

gee gga etc att cat aaa cgt aca aaa tea teg etc egg acg cag gtt 1344 
Ala Gly Leu lie His Lys Arg Thr Lys Ser Ser Leu Arg Thr Gin Val 
435 440 445 

gaa gga taa 1353 
Glu Gly * 
450 



<210> 193 
<211> 741 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (741) 

<400> 193 

atg aat aac aac acc acg gca cca acc tat acg ctg cgt ggc tta cag 48 

Met Asn Asn Asn Thr Thr Ala Pro Thr Tyr Thr Leu Arg Gly Leu Gin 

15 10 15 

ttg att ggt tgg cgt gat atg cag cac gee etc gat tat ctg ttt get 96 
Leu lie Gly Trp Arg Asp Met Gin His Ala Leu Asp Tyr Leu Phe Ala 
20 25 30 

gac ggg cag ctt aag cag gga acg ctg gtt gec att aat get gaa aaa 144 
Asp Gly Gin Leu Lys Gin Gly Thr Leu Val Ala lie Asn Ala Glu Lys 
35 40 45 

atg ctg act att gaa gat aac gec gag gtc agg gag tta att aac get 192 
Met Leu Thr lie Glu Asp Asn Ala Glu Val Arg Glu Leu lie Asn Ala 
50 55 60 

gec gaa ttt aaa tat gcg gat ggc ate age gtt gta cgt tea gta cgt 240 
Ala Glu Phe Lys Tyr Ala Asp Gly lie Ser Val Val Arg Ser Val Arg 
65 70 75 80 

aaa aag tac ccg cag gcg cag gtt tec cgc gtt gee ggt gee gat etc 288 
Lys Lys Tyr Pro Gin Ala Gin Val Ser Arg Val Ala Gly Ala Asp Leu 
85 90 95 

tgg gaa gag ctg atg gcg cgc gca ggc aaa gaa ggg acg ccg gta ttt 336 
Trp Glu Glu lieu Met Ala Arg Ala Gly Lys Glu Gly Thr Pro Val Phe 
100 105 110 

ctt gtg ggc ggt aaa cct gaa gtg ctg gcg caa act gaa get aaa ctg 384 
Leu Val Gly Gly Lys Pro Glu Val Leu Ala Gin Thr Glii Ala Lys Leu 
115 120 125 

cgc aac cag tgg aat gtg aat ate gtt ggc agt cag gat ggt tat ttt 432 
Arg Asn Gin Trp Asn Val Asn lie Val Gly Ser Gin Asp Gly Tyr Phe 
. 130 135 140 

aaa ccc gag cag cgt cag gcg ctg ttt gaa cgc att cat gee age ggt 480 
Lys Pro Glu Gin Arg Gin Ala Leu Phe Glu Arg He His Ala Ser Gly 
145 150 155 160 
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gcg caa ate gtc acc gtt gcg atg gga teg cca aag cag gag ate ate 528 
Ala Gin He Val Thr Val Ala Met Gly Ser Pro Lys Gin Glu He He 
165 170 175 

atg cgc gac tgc cgt ctg gta cat cca gat gcg ctg tat atg ggc gtt 576 
Met Arg Asp Cys Arg Leu Val His Pro Asp Ala Leu Tyr Met Gly Val 
180 185 190 

ggc ggg act tac gat gtt ttc acc ggt cac gta aaa cgc gca ccg aaa 624 
Gly Gly Thr Tyr Asp Val Phe Thr Gly His Val Lys Arg Ala Pro Lys 
, 195 200 205 

ate tgg caa acg ctg ggg ctg gag tgg etc tac cgc ctg ctt teg cag 672 
He Trp Gin Thr Leu Gly Leu Glu Trp Leu Tyr Arg Leu Leu Ser Gin 
210 215 220 

ccg age cgc att aag cgt cag ctt cgt ttg ctg cgt tat tta cgc tgg 720 
Pro Ser Arg He Lys Arg Gin Leu Arg Leu lieu Arg Tyr Leu Arg Trp 
225 230 235 240 

cac tac acc ggc aac eta tga 741 
His Tyr Thr Gly Asn Leu * 
245 



<210> 194 
<211> 225 
<212> DNA 

<213> Escherichia coli 

<220> 

<22l> CDS 

<222> (1) . . . (225) 

<400> 194 

atg act gta ctg att cac gta ctg gga teg gat ate cct cac cat aac 48 
Met Thr Val Leu He His Val Leu Gly Ser Asp He Pro His His Asn 
1 5 10 15 

cga acc gtt ttg egg ttt ttc aat gac gcg ctg gec gcg acg age gag 96 
Arg Thr Val Leu Arg Phe Phe Asn Asp Ala Leu Ala Ala Thr Ser Glu 
20 25 30 

cac gcg cgc gag ttt atg gtt gtt ggc aag gac gac ggc tta agt gat 144 
His Ala Arg Glu Phe Met Val Val Gly Lys Asp Asp Gly Leu Ser Asp 
35 40 45 

age tgt ccg gcg ctt tct gtg caa ttt ttc ccc tgg gaa aaa ate get 192 
Ser Cys Pro Ala Leu Ser Val Gin Phe Phe Pro Trp Glu Lys He Ala 
50 55 60 

ggc gga age ggt cat cgc gaa age aaa age taa 225 
Gly Gly Ser Gly His Arg Glu Ser Lys Ser * 
65 70 



<210> 195 
<211> 615 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . (615) 

<400> 195 

gtg egg ggc gaa ctg ctg ttc ttc ccg acg egg atg gac cct teg etc 48 
Met Arg Gly Glu Leu Leu Phe Phe Pro Thr Arg Met Asp Pro Ser Leu 
15 10 15 

aat acg atg gcg aac gat egg caa cgt gaa ggg aaa atg ace att ctg 96 
Asn Thr Met Ala Asn Asp Arg Gin Arg Glu Gly Lys Met Thr lie Leu 
20 25 30 

gtg ggg aac tec ggc gac cgc age aat gag cat att get gee ttg cgc 144 
Val Gly Asn Ser Gly Asp Arg Ser Asn Glu His lie Ala Ala Leu Arg 
35 40 45 

gee gtt cat cag caa ttt ggc gat acg gta aaa gtg gtg gtg ccg atg 192 
Ala Val His Gin Gin Phe Gly Asp Thr Val Lys Val Val Val Pro Met 
50 55 60 

gga tat ccg cct aat aac gaa gcg tac att gag gaa gtt cgt cag gcg 240 
Gly Tyr Pro Pro Asn Asn Glu Ala Tyr lie Glu Glu Val Arg Gin Ala 
65 70 75 80 

ggg ctg gag tta ttc age gaa gaa aat eta caa att ctg age gaa aaa 288 
Gly Leu Glu Leu Phe Ser Glu Glu Asn Leu Gin He Leu Ser Glu Lys 
85 90 95 

ct!g gaa ttt gac gee tat ctg gcg eta ctt cgt cag tgc gat ctt ggt 336 
Leu Glu Phe Asp Ala Tyr Leu Ala Leu Leu Arg Gin Cys Asp Leu Gly 
100 105 110 

tac ttt att ttt gee cgc cag cag ggc att ggt acg ctg tgc tta ctg 384 
Tyr Phe lie Phe Ala Arg Gin Gin Gly He Gly Thr Leu Cys Leu Leu 
115 120 125 

att cag gcg ggc att cct tgt gtg ctt aac egg gaa aat ccg ttc tgg 432 
He Gin Ala Gly He Pro Cys Val Leu Asn Arg Glu Asn Pro Phe Trp 
130 135 140 

cag gat atg acg gaa caa cat ttg ccg gtg ctg ttt act ace gac gat 480 
Gin Asp Met Thr Glu Gin His Leu Pro Val Leu Phe Thr Thr Asp Asp 
145 150 155 160 

etc aac gag gat att gtg cgt gaa gcg cag cgc cag ttg gcg teg gtg 528 
Leu Asn Glu Asp He Val Arg Glu Ala Gin Arg Gin Leu Ala Ser Val 
165 170 175 

gat aaa aac acc att gee ttc ttt age cct aac tat eta caa ggc tgg 576 
Asp Lys Asn Thr He Ala Phe Phe Ser Pro Asn Tyr Leu Gin Gly Trp 
180 185 190 

cag egg gcg ttg gcg att gee gee agg gag gtc gca tga 615 
Gin Arg Ala Leu Ala He Ala Ala Arg Glu Val Ala * 
195 200 



<210> 196 
<211> 549 
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<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (549) 

<400> 196 

atg att egg caa cgt cgt cgt gcg tta acg ccg gaa caa cag cag gaa 48 
Met lie Arg Gin Arg Arg Arg Ala Leu Thr Pro Glu Gin Gin Gin Glu 
15 10 15 

atg ggt caa caa gec get acc egg atg atg act tat ccc ccg gtg gtg 96 
Met Gly Gin Gin Ala Ala Thr Arg Met Met Thr Tyr Pro Pro Val Val 
20 25 30 

atg gca cat acg gtc get gta ttc etc tct ttt gat ggc gaa etc gac 144 
Met Ala His Thr Val Ala Val Phe Leu Ser Phe Asp Gly Glu Leu Asp 
35 40 45 

acc cag cca etc ata gaa caa etc tgg cgc gee ggt aag cgc gta tat 192 
Thr Gin Pro Leu lie Glu Gin Leu Trp Arg Ala Gly Lys Arg Val Tyr 
50 55 60 

ctt cca gtt ttg cat ccc ttt agt gee ggt aat ttg ctg ttc ctg aat 240 
Leu Pro Val Leu His Pro Phe Ser Ala Gly Asn Leu Leu Phe Leu Asn 
65 70 75 80 

tac cat ccg caa age gaa ctg gtg atg aac agg ttg aag ate cat gag 288 
Tyr His Pro Gin Ser Glu Leu Val Met Asn Arg Leu Lys lie His Glu 
85 90 95 

cca aaa ttg gat gtg cgt gac gtg eta ccc ctt tec cga tta gac gtg 336 
Pro Lys Leu Asp Val Arg Asp Val Leu Pro Leu Ser Arg Leu Asp Val 
100 105 110 

ctg ate aca ccg ctg gtc gee ttt gat gag tac ggt cag cgc ctg gga 384 
Leu lie Thr Pro Leu Val Ala Phe Asp Glu Tyr Gly Gin Arg Leu Gly 
115 120 125 

atg ggc ggt ggt ttt tat gat egg acc tta caa aac tgg cag cac tat 432 
Met Gly Gly Gly Phe Tyr Asp Arg Thr Leu Gin Asn Trp Gin His Tyr 
130 135 140 

aaa acg caa ccg gtg ggt tat gcg cat gat tgt cag ttg gtg gaa aaa 480 
Lys Thr Gin Pro Val Gly Tyr Ala His Asp Cys Gin Leu Val Glu Lys 
145 150 155 160 

etc ccc gtt gaa gag tgg gat ate cct ctt cct gcg gtg gtt aca ccg 528 
Leu Pro Val Glu Glu Trp Asp lie Pro Leu Pro Ala Val Val Thr Pro 
165 170 175 

teg aaa gtc tgg gag tgg taa 549 
Ser Lys Val Trp Glu Trp * 
180 



<210> 197 

<211> 276 

<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (276) 

<400> 197 

atg gca cgc gta act gtt cag gac get gta gag aaa att ggt aac cgt 48 
Met Ala Arg Val Thr Val Gin Asp Ala Val Glu Lys lie Gly Asia Arg 
1 5 10 15 

ttt gac ctg gta ctg gtc gec gcg cgt cgc get cgt cag atg cag gta 96 
Phe Asp Leu Val Leu Val Ala Ala Arg Arg Ala Arg Gin Met Gin Val 
20 25 30 

ggc gga aag gat ccg ctg gta ccg gaa gaa aac gat aaa acc act gta 144 
Gly Gly Lys Asp Pro Leu Val Pro Glu Glu Asn Asp Lys Thr Thr Val 
35 40 45 

ate gcg ctg cgc gaa ate gaa gaa ggt ctg ate aac aac cag ate etc 192 
lie Ala Leu Arg Glu lie Glu Glu Gly Leu lie Asn Asn Gin lie Leu 
50 55 60 

gac gtt cgc gaa cgc cag gaa cag caa gag cag gaa gee get gaa tta 240 
Asp Val Arg Glu Arg Gin Glu Gin Gin Glu Gin Glu Ala Ala Glu Leu 
65 70 75 80 

caa gee gtt acc get att get gaa ggt cgt cgt taa 276 
Gin Ala Val Thr Ala lie Ala Glu Gly Arg Arg * 
85 90 



<210> 198 
<211> 2109 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2109) 

<400> 198 

ttg tat ctg ttt gaa age ctg aat caa ctg att caa acc tac ctg ccg 48 

Met Tyr Leu Phe Glu Ser Leu Asn Gin Leu lie Gin Thr Tyr Leu Pro 

15 10 15 



gaa gac caa ate aag cgt ctg egg cag gcg tat etc gtt gca cgt gat 96 
Glu Asp Gin lie Lys Arg Leu Arg Gin Ala Tyr Leu Val Ala Arg Asp 
20 25 30 

get cac gag ggg caa aca cgt tea age ggt gaa ccc tat ate acg cac 144 
Ala His Glu Gly Gin Thr Arg Ser Ser Gly Glu Pro Tyr lie Thr His 
35 40 45 

ccg gta gcg gtt gee tgc att ctg gee gag atg aaa etc gac tat gaa 192 
Pro Val Ala Val Ala Cys lie Leu Ala Glu Met Lys Leu Asp Tyr Glu 
50 55 60 

acg ctg atg gcg gcg ctg ctg cat gac gtg att gaa gat act ccc gec 240 
Thr Leu Met Ala Ala Leu Leu His Asp Val lie Glu Asp Thr Pro Ala 
65 70 75 80 
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acc tac cag gat atg gaa cag ctt ttt ggt aaa age gtc gec gag ctg 288 
Thr Tyr Gin Asp Met Glu Gin Leu Phe Gly Lys Ser Val Ala Glu Leu 
85 90 95 

gta gag ggg gtg teg aaa ctt gat aaa etc aag ttc cgc gat aag aaa 336 
Val Glu Gly Val Ser Lys Leu Asp Lys Leu Lys Phe Arg Asp Lys Lys 
100 105 110 

gag gcg cag gee gaa aac ttt cgc aag atg att atg gcg atg gtg cag 384 
Glu Ala Gin Ala Glu Asn Phe Arg Lys Met lie Met Ala Met Val Gin 
115 120 125 

gat ate cgc gtc ate etc ate aaa ctt gee gac cgt acc cac aac atg 432 
Asp lie Arg Val lie Leu lie Lys Leu Ala Asp Arg Thr His Asn Met 
130 135 140 

cgc acg ctg ggc tea ctt cgc ccg gac aaa cgt cgc cgc ate gec cgt 480 
Arg Thr Leu Gly Ser Leu Arg Pro Asp Lys Arg Arg Arg lie Ala Arg 
145 150 155 160 

gaa act etc gaa att tat age ccg ctg gcg cac cgt tta ggt ate cac 528 
Glu Thr Leu Glu lie Tyr Ser Pro Leu Ala His Arg Leu Gly He His 
165 170 175 

cac att aaa acc gaa etc gaa gag ctg ggt ttt gag gcg ctg tat ccc 576 
His He Lys Thr Glu Leu Glu Glu Leu Gly Phe Glu Ala Leu Tyr Pro 
180 185 190 

aac cgt tat cgc gta ate aaa gaa gtg gtg aaa gec gcg cgc ggc aac 624 
Asn Arg Tyr Arg Val He Lys Glu Val Val Lys Ala Ala Arg Gly Asn 
155 200 205 

cgt aaa gag atg ate cag aag att ctt tct gaa ate gaa ggg cgt ttg 672 
Arg Lys Glu Met He Gin Lys He Leu Ser Glu He Glu Gly Arg Leu 
210 215 220 

cag gaa gcg gga at a ccg tgc cgc gtc agt ggt cgc gag aag cat ctt 720 
Gin Glu Ala Gly He Pro Cys Arg Val Ser Gly Arg Glu Lys His Leu 
225 230 235 240 

tat teg att tac tgc aaa atg gtg etc aaa gag cag cgt ttt cac teg 768 
Tyr Ser He Tyr Cys Lys Met Val Leu Lys Glu Gin Arg Phe His Ser 
245 250 255 

ate atg gac ate tac get ttc cgc gtg ate gtc aat gat tct gac acc ,816 
He Met Asp He Tyr Ala Phe Arg Val He Val Asn Asp Ser Asp Thr 
260 265 270 

tgt tat cgc gtg ctg ggc cag atg cac age ctg tac aag ccg cgt ccg 864 
Cys Tyr Arg Val Leu Gly Gin Met His Ser Leu Tyr Lys Pro Arg Pro 
275 280 285 

ggc cgc gtg aaa gac tat ate gec att cca aaa gcg aac ggc tat cag 912 
Gly Arg Val Lys Asp Tyr He Ala He Pro Lys Ala Asn Gly Tyr Gin 
290 295 300 

tct ttg cac acc teg atg ate ggc ccg cac ggt gtg ccg gtt gag gtc 960 
Ser Leu His Thr Ser Met He Gly Pro His Gly Val Pro Val Glu Val 
305 310 315 320 

cag ate cgt acc gaa gat atg gac cag atg gcg gag atg ggt gtt gec 1008 
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Gin lie Arg Thr Glu Asp Met Asp Gin Met Ala Glu Met Gly Val Ala 
325 330 335 

gcg cac tgg get tat aaa gag cac ggc gaa acc agt act acc gca caa 1056 
Ala His Trp Ala Tyr Lys Glu His Gly Glu Thr Ser Thr Thr Ala Gin 
340 345 350 

ate cgc gec cag cgc tgg atg caa age ctg ctg gag ctg caa cag age 1104 
lie Arg Ala Gin Arg Trp Met Gin Ser Leu Leu Glu Leu Gin Gin Ser 
355 360 365 

gee ggt agt teg ttt gaa ttt ate gag age gtt aaa tec gat etc ttc 1152 
Ala Gly Ser Ser Phe Glu Phe lie Glu Ser Val Lys Ser Asp Leu Phe 
370 375 380 

ccg gat gag att tac gtt ttc aca ccg gaa ggg cgc att gtc gag ctg 1200 
Pro Asp Glu lie Tyr Val Phe Thr Pro Glu Gly Arg lie Val Glu Leu 
385 390 395 400 

cct gee ggt gca acg ccc gtc gac ttc get tat gca gtg cat acc gat 1248 
Pro Ala Gly Ala Thr Pro Val Asp Phe Ala Tyr Ala Val His Thr Asp 
405 410 415 

ate ggt cat gec tgc gtg ggc gca cgc gtt gac cgc cag cct tac ccg 1296 
lie Gly His Ala Cys Val Gly Ala Arg Val Asp Arg Gin Pro Tyr Pro 
420 425 430 

ctg teg cag ccg ctt acc age ggt caa acc gtt gaa ate att acc get 1344 
lieu Ser Gin Pro Leu Thr Ser Gly Gin Thr Val Glu He He Thr Ala 
435 440 445 

ccg ggc get cgc ccg aat gee get tgg ctg aac ttt gtc gtt age teg 1392 
Pro Gly Ala Arg Pro Asn Ala Ala Trp Leu Asn Phe Val Val Ser Ser 
450 455 460 

aaa gcg cgc gec aaa att cgt cag ttg ctg aaa aac etc aag cgt gat 1440 
Lys Ala Arg Ala Lys He Arg Gin Leu Leu Lys Asn Leu Lys Arg Asp 
465 470 475 480 

gat tct gta age ctg ggc cgt cgt ctg etc aac cat get ttg ggt ggt 1488 
Asp Ser Val Ser Leu Gly Arg Arg Leu Leu Asn His Ala Leu Gly Gly 
485 490 495 

age cgt aag ctg aat gaa ate ccg cag gaa aat att cag cgc gag ctg 1536 
Ser Arg Lys Leu Asn Glu He Pro Gin Glu Asn He Gin Arg Glu Leu 
500 505 510 

gat cgc atg aag ctg gca acg ctt gac gat ctg ctg gca gaa ate gga 1584 
Asp Arg Met Lys Leu Ala Thr Leu Asp Asp Leu Leu Ala Glu He Gly 
515 520 525 

ctt ggt aac gca atg age gtg gtg gtc gcg aaa aat ctg caa cat ggg 1632 
Leu Gly Asn Ala Met Ser Val Val Val Ala Lys Asn Leu Gin His Gly 
530 535 540 

gac gee tec att cca ccg gca acc caa age cac gga cat ctg ccc att 1680 
Asp- Ala Ser He Pro Pro Ala Thr Gin Ser His Gly His Leu Pro He 
545 550 555 560 

aaa ggt gee gat ggc gtg ctg ate acc ttt gcg aaa tgc tgc cgc cct 1728 
Lys Gly Ala Asp Gly Val Leu He Thr Phe Ala Lys Cys Cys Arg Pro 



-162- 



WO 01/48209 PCT/US00/34419 



565 570 575 



att cct ggc gac ccg att ate gec cac gtc age ccc ggt aaa ggt ctg 1776 
lie Pro Gly Asp Pro lie lie Ala His Val Ser Pro Gly Lys Gly Leu 
580 585 590 

gtg ate cac cat gaa tec tgc cgt aat ate cgt ggc tac cag aaa gag 1824 
Val He His His Glu Ser Cys Arg Asn He Arg Gly Tyr Gin Lys Glu 
595 600 605 

cca gag aag ttt atg get gtg gaa tgg gat aaa gag acg gcg cag gag 1872 
Pro Glu Lys Phe Met Ala Val Glu Trp Asp Lys Glu Thr Ala Gin Glu 
610 615 620 

ttc ate ace gaa ate aag gtg gag atg ttc aat cat cag ggt gcg ctg 1920 
Phe He Thr Glu He Lys Val Glu Met Phe Asn His Gin Gly Ala Leu 
625 630 635 640 

gca aac ctg acg gcg gca att aac acc acg act teg aat att caa agt 1968 
Ala Asn Leu Thr Ala Ala He Asn Thr Thr Thr Ser Asn He Gin Ser 
645 650 655 

ttg aat acg gaa gag aaa gat ggt cgc gtc tac age gec ttt att cgt 2016 
Leu Asn Thr Glu Glu Lys Asp Gly Arg Val Tyr Ser Ala Phe He Arg 
660 665 670 

ctg acc get cgt gac cgt gtg cat ctg gcg aat ate atg cgc aaa ate 2064 
Leu Thr Ala Arg Asp Arg Val His Leu Ala Asn He Met Arg Lys He 
675 680 685 

cgc gtg atg cca gac gtg att aaa gtc acc cga aac cga aat taa 2109 
Arg Val Met Pro Asp Val He Lys Val Thr Arg Asn Arg Asn * 
690 695 700 



<210> 199 
<211> 690 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(690) 

<400> 199 

atg aac cca aca cgt tat gca cgc ate tgc gaa atg etc gec agg egg 48 
Met Asn Pro Thr Arg Tyr Ala Arg He Cys Glu Met Leu Ala Arg Arg 
1 5 10 15 



cag cct gat ctg acc gtc tgc atg gag cag gtc cac aaa cct cat aac 96 
Gin Pro Asp Leu Thr Val Cys Met Glu Gin Val His Lys Pro His Asn 
20 25 30 

gtt tct gcg att att cgt acc gca gat gec gtt ggc gta cat gaa gtt 144 
Val Ser Ala He He Arg Thr Ala Asp Ala Val Gly Val His Glu Val 
35 40 45 

cac gee gtc tgg cct ggt age cgc atg cgc acc atg get teg gca gcg 192 
His Ala Val Trp Pro Gly Ser Arg Met Arg Thr Met Ala Ser Ala Ala 
50 55 60 
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gcg ggt agt aac age tgg gta cag gtg aaa aca cac cgc acc att ggc 240 
Ala Gly Ser Asn Ser Trp Val Gin Val Lys Thr His Arg Thr lie Gly 
65 70 75 80 

gat gec gtc get cat etc aaa ggc cag ggc atg cag att ctg gca acc 288 
Asp Ala Val Ala His Leu Lys Gly Gin Gly Met Gin lie Leu Ala Thr 
85 90 95 

cat ctt tct gat aac get gtc gat ttc cgc gaa att gat tac act cgc 336 
His Leu Ser Asp Asn Ala Val Asp Phe Arg Glu He Asp Tyr Thr Arg 
100 105 110 

ccg acc tgc att ttg atg gga cag gag aaa acg ggc ate acg cag gaa 384 
Pro Thr Cys He Leu Met Gly Gin Glu Lys Thr Gly lie Thr Gin Glu 
115 120 125 

gca ttg gec ctg gcg gat cag gac ate ate att ccg atg ate ggc atg 432 
Ala Leu Ala Leu Ala Asp Gin Asp He He He Pro Met He Gly Met 
130 135 140 

gtg cag teg ctg aat gtt tec gtt gee tea gec etc att ctt tac gaa 480 
Val Gin Ser Leu Asn Val Ser Val Ala Ser Ala Leu He Leu Tyr Glu 
145 150 155 160 

gec cag cgt cag egg caa aat gca ggc atg tac ctg cgt gaa aac age 528 
Ala Gin Arg Gin Arg Gin Asn Ala Gly Met Tyr Leu Arg Glu Asn Ser 
165 170 175 

atg ttg ccg gaa gca gag caa caa cgc ctg ttg ttt gaa ggc ggc tat 576 
Met Leu Pro Glu Ala Glu Gin Gin Arg Leu Leu Phe Glu Gly Gly Tyr 
" 180 185 190 

CC ST gtg ctg gcg aaa gtc gca aaa cgc aaa ggc ctg cct tat ccc cac 624 
Pro Val Leu Ala Lys Val Ala Lys Arg Lys Gly Leu Pro Tyr Pro His 
195 200 205 

gtc aat cag caa ggc gag ate gaa get gat gee gac tgg tgg get act 672 
Val Asn Gin Gin Gly Glu He Glu Ala Asp Ala Asp Trp Trp Ala Thr 
210 215 220 

atg cag get gca ggg taa 690 

Met Gin Ala Ala Gly * 

225 



<210> 200 
<211> 2082 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2082) 

<400> 200 

atg aaa ggt cgc ctg tta gat get gtc cca etc agt tec eta acg ggc 48 
Met Lys Gly Arg Leu Leu Asp Ala Val Pro Leu Ser Ser Leu Thr Gly 
15 10 15 

gtt ggc gca gca ctt agt aac aaa ctg gcg aaa ate aac ctg cat acc 96 
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Val Gly Ala Ala Leu Ser Asn Lys Leu Ala Lys He Asn Leu His Thr 
20 25 30 

gtg cag gat eta etc tta cac ctt ccc ctg cgc tac gaa gat cgc acc 144 
Val Gin Asp Leu Leu Leu His Leu Pro Leu Arg Tyr Glu Asp Arg Thr 
35 40 ^ 45 

cat etc tac ccc ate gga gaa eta ctg ccg ggc gtt tat gee acg gtg 192 
His Leu Tyr Pro He Gly Glu Leu Leu Pro Gly Val Tyr Ala Thr Val 
50 55 60 

gaa ggc gaa gtg ctg aac tgc aat ate tec ttc ggc ggt egg egg atg 240 
Glu Gly Glu Val Leu Asn Cys Asn He Ser Phe Gly Gly Arg Arg Met 
65 70 75 80 

atg acc tgc cag ate age gac ggt tec ggc ate etc acc atg cgc ttt 288 
Met Thr Cys Gin He Ser Asp Gly Ser Gly He Leu Thr Met Arg Phe 
85 90 95 

ttc aat ttc age gcg gca atg aaa aat age ctg gcg gcg ggc cgc cgt 336 
Phe Asn Phe Ser Ala Ala Met Lys Asn Ser Leu Ala Ala Gly Arg Arg 
100 105 110 

gta ctg get tat ggc gaa gca aag cgc ggt aaa tat ggt gcg gag atg 384 
Val Leu Ala Tyr Gly Glu Ala Lys Arg Gly Lys Tyr Gly Ala Glu Met 
115 120 125 

ate cac ccg gaa tac cgc gtg cag ggc gat etc age acg cca gaa tta 432 
He His Pro Glu Tyr Arg Val Gin Gly Asp Leu Ser Thr Pro Glu Leu 
130 135 140 

cag gaa acg etc acg ccg gtt tat cca aca acg gaa ggc gta aag cag 480 
Gin Glu Thr Leu Thr Pro Val Tyr Pro Thr Thr Glu Gly Val Lys Gin 
145 150 155 160 

gec acg ctg cgt aaa tta acc gac cag gcg ctg gat ctg etc gac acc 528 
Ala Thr Leu Arg Lys Leu Thr Asp Gin Ala Leu Asp Leu Leu Asp Thr 
165 170 175 

tgc gec att gaa gaa etc ctg ccg ccg gaa ctg tea caa gga atg atg 576 
Cys Ala He Glu Glu Leu Leu Pro Pro Glu Leu Ser Gin Gly Met Met 
180 185 190 

acg eta ccg gaa gcg ttg cgc act ttg cac cgc ccg cca ccg acg eta 624 
Thr Leu Pro Glu Ala Leu Arg Thr Leu His Arg Pro Pro Pro Thr Leu 
195 200 205 

cag ctt age gat ctg gaa acc ggg cag cat ccg gcg caa cgt cgt ctg 672 
Gin Leu Ser Asp Leu Glu Thr Gly Gin His Pro Ala Gin Arg Arg Leu 
210 215 220 

att ctg gaa gaa ctg ctg gcg cac aac etc age atg tta gec tta cgt 720 
He Leu Glu Glu Leu Leu Ala His Asn Leu Ser Met Leu Ala Leu Arg 
225 230 235 240 

gec gga gca cag cgt ttt cat gec cag ccg ctg age gee aat gac acg 768 
Ala Gly Ala Gin Arg Phe His Ala Gin Pro Leu Ser Ala Asn Asp Thr 
245 250 255 

ctg aaa aat aaa etc etc gee gee tta ccg ttc aag cca acg ggc gca 816 
Leu Lys Asn Lys Leu Leu Ala Ala Leu Pro Phe Lys Pro Thr Gly Ala 
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260 265 270 

cag gca cgc gta gtg gcg gag ate gag cgc gat atg gcg ctg gat gtg 864 
Gin Ala Arg Val Val Ala Glu He Glu Arg Asp Met Ala Leu Asp Val 
275 280 285 

ccg atg atg cgt ctg gtg cag ggc gat gta ggt tec ggt aaa acg ctg 912 
Pro Met Met Arg Leu Val Gin Gly Asp Val Gly Ser Gly Lys Thr Leu 
290 295 300 

gtc gec gec etc gec gcg ttg cgt gcg att gec cac ggc aaa cag gta 960 
Val Ala Ala Leu Ala Ala Leu Arg Ala He Ala His Gly Lys Gin Val 
305 310 315 320 

gca ttg atg gca cca acc gaa tta etc gee gag cag cac gec aat aac 1008 
Ala Leu Met Ala Pro Thr Glu Leu Leu Ala Glu Gin His Ala Asn Asn 
325 330 335 

ttc cgc aac tgg ttt gca ccg etc ggt ate gaa gtg ggc tgg etc gec 1056 
Phe Arg Asn Trp Phe Ala Pro Leu Gly He Glu Val Gly Trp Leu Ala 
340 345 350 

ggt aag cag aaa ggt aaa gca egg ctg gca cag cag gaa gec ate gec 1104 
Gly Lys Gin Lys Gly Lys Ala Arg Leu Ala Gin Gin Glu Ala He Ala 
355 360 365 

age ggt cag gtg cag atg att gtc ggt aca cac gee ate ttc cag gaa 1152 
Ser Gly Gin Val Gin Met He Val Gly Thr His Ala He Phe Gin Glu 
370 375 380 

cag gtg cag ttt aac ggc ctg gcg ctg gtg att ate gac gaa cag cat 1200 
Gin Val Gin Phe Asn Gly Leu Ala Leu Val He He Asp Glu Gin His 
385 390 395 400 

cgt ttt ggc gtg cat cag cgt ctg gca ttg tgg gag aaa ggc cag cag 1248 
Arg Phe Gly Val His Gin Arg Leu Ala Leu Trp Glu Lys Gly Gin Gin 
405 410 415 

cag ggc ttc cat ccg cat cag ttg ate atg acc gec acg ccg ate ccc 1296 
Gin Gly Phe His Pro His Gin Leu He Met Thr Ala Thr Pro He Pro 
420 425 430 

cgc acg ctg gca atg act gcg tat gec gat etc gat acc teg gtg ata 1344 
Arg Thr Leu Ala Met Thr Ala Tyr Ala Asp Leu Asp Thr Ser Val He 
435 440 445 

gat gag ctg ccg cca ggc cgc acg cca gtg act acg gtc get att cct 1392 
Asp Glu Leu Pro Pro Gly Arg Thr Pro Val Thr Thr Val Ala He Pro 
450 455 460 

gat acc cgc cgt acc gac ate att gac cgc gtg cac cac gec tgc ata 1440 
Asp Thr Arg Arg Thr Asp He He Asp Arg Val His His Ala Cys He 
465 ~ 470 475 480 

act gaa ggt cgt cag gca tac tgg gtt tgt acg ttg att gaa gag teg 1488 
Thr Glu Gly Arg Gin Ala Tyr Trp Val Cys Thr Leu He Glu Glu Ser 
485 490 495 

gaa ttg ctg gaa gcg cag gcg gcg gaa get acc tgg gaa gag ttg aaa 1536 
Glu Leu Leu Glu Ala Gin Ala Ala Glu Ala Thr Trp Glu Glu Leu Lys 
500 505 510 
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ctg gcg eta cca gag ttg aac gtt ggc ctg gta cac ggg egg atg aaa 1584 
Leu Ala Leu Pro Glu Leu Asn Val Gly Leu Val His Gly Arg Met Lys 
515 520 525 

cct gec gag aaa cag gcg gtg atg gcg teg ttt aaa caa ggt gag eta 1632 
Pro Ala Glu Lys Gin Ala Val Met Ala Ser Phe Lys Gin Gly Glu Leu 
530 535 540 

cac ctg ctg gtt gec aca acc gtt att gaa gtc ggc gtt gat gtg cct 1680 
His Leu Leu Val Ala Thr Thr Val lie Glu Val Gly Val Asp Val Pro 
545 550 555 560 

aac gec agt ctg atg att ate gaa aac ccg gag cgt ctg ggt ctg gcg 1728 
Asn Ala Ser Leu Met lie lie Glu Asn Pro Glu Arg Leu Gly Leu Ala 
565 570 575 

cag tta cac cag ctg cgc ggg cgc gta ggt cgt ggc gcg gtg get tct 1776 
Gin Leu His Gin Leu Arg Gly Arg Val Gly Arg Gly Ala Val Ala Ser 
580 585 590 

cac tgc gtg ctg etc tac aaa acg ccg ctt tct aaa acg gcg caa att 1824 
His Cys Val Leu Leu Tyr Lys Thr Pro Leu Ser Lys Thr Ala Gin He 
595 600 605 

cgc ctg caa gtg ctg cgc gac agt aac gac ggt ttt gtg att gcg caa 1872 
Arg Leu Gin Val Leu Arg Asp Ser Asn Asp Gly Phe Val He Ala Gin 
610 615 620 

aaa gat ctg gag att cgc ggc cct ggc gaa ttg tta ggc acg cgt cag 1920 
Lys Asp Leu Glu He Arg Gly Pro Gly Glu Leu Leu Gly Thr Arg Gin 
625 630 635 640 

acg ggt aat get gaa ttt aaa gtg gcg gat tta ctg cgc gat cag gcg 1968 
Thr Gly Asn Ala Glu Phe Lys Val Ala Asp Leu Leu Arg Asp Gin Ala 
645 650 655 

atg ate ccg gaa gtt cag cgc ctg gca cgc cat att cac gaa cgt tac 2016 
Met He Pro Glu Val Gin Arg Leu Ala Arg His He His Glu Arg Tyr 
660 665 670 

cca caa cag gca aaa gee ctg ata gaa cgc tgg atg ccg gag acg gaa 2064 
Pro Gin Gin Ala Lys Ala Leu He Glu Arg Trp Met Pro Glu Thr Glu 
675 680 685 

cgt tac teg aat gcg taa 2082 
Arg Tyr Ser Asn Ala * 
690 



<210> 201 
<211> 1287 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1287) 

<400> 201 

atg aaa acc tct ctg ttt aaa age ctt tac ttt cag gtc ctg aca gcg 48 
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Met Lys Thr Ser Leu Phe Lys Ser Leu Tyr Phe Gin Val Leu Thr Ala 
15 10 15 

ata gcc att ggt att etc ctt ggc cat ttc tat cct gaa ata ggc gag 96 
lie Ala lie Gly lie Leu Leu Gly His Phe Tyr Pro Glu lie Gly Glu 
20 25 30 

caa atg aaa ccg ctt ggc gac ggc ttc gtt aag etc att aag atg ate 144 
Gin Met Lys Pro Leu Gly Asp Gly Phe Val Lys Leu lie Lys Met lie 
35 40 45 

ate get cct gtc ate ttt tgt ace gtc gta acg ggc att gcg ggc atg 192 
lie Ala Pro Val lie Phe Cys Thr Val Val Thr Gly lie Ala Gly Met 
50 55 60 

gaa age atg aag gcg gtc ggt cgt acc ggc gca gtc gca ctg ctt tac 240 
Glu Ser Met Lys Ala Val Gly Arg Thr Gly Ala Val Ala Leu Leu Tyr 
65 70 75 80 

ttt gaa att gtc agt acc ate gcg ctg att att ggt ctt ate ate gtt 288 
Phe Glu lie Val Ser Thr lie Ala Leu He He Gly Leu He He Val 
85 90 95 

aac gtc gtg cag cct ggt gcc gga atg aac gtc gat ccg gca acg ctt 336 
Asn Val Val Gin Pro Gly Ala Gly Met Asn Val Asp Pro Ala Thr Leu 
100 105 110 

gat gcg aaa gcg gta gcg gtt tac gcc gat cag gcg aaa gac cag ggc 384 
Asp Ala Lys Ala Val Ala Val Tyr Ala Asp Gin Ala Lys Asp Gin Gly 
115 120 125 

att gtc gcc ttc att atg gat gtc ate ccg gcg age gtc att ggc gca 432 
He Val Ala Phe He Met Asp Val He Pro Ala Ser Val He Gly Ala 
130 135 140 

ttt gcc age ggt aac att ctg cag gtg ctg ctg ttt gcc gta ctg ttt 480 
Phe Ala Ser Gly Asn He Leu Gin Val Leu Leu Phe Ala Val Leu Phe 
145 150 155 160 

ggt ttt gcg etc cac cgt ctg ggc age aaa ggc caa ctg att ttt aac 528 
Gly Phe Ala Leu His Arg Leu Gly Ser Lys Gly Gin Leu He Phe Asn 
165 170 175 

gtc ate gaa agt ttc teg cag gtc ate ttc ggc ate ate aat atg ate 576 
Val He Glu Ser Phe Ser Gin Val He Phe Gly He He Asn Met He 
180 185 190 

atg cgt ctg gca cct att ggt gcg ttc ggg gca atg gcg ttt acc ate 624 
Met Arg Leu Ala Pro He Gly Ala Phe Gly Ala Met Ala Phe Thr He 
195 200 205 

ggt aaa tac ggc gtc ggc aca ctg gtg caa ctg ggg cag ctg att ate 672 
Gly Lys Tyr Gly Val Gly Thr Leu Val Gin Leu Gly Gin Leu He He 
210 215 220 

tgt ttc tac att acc tgt ate ctg ttt gtg gtg ctg gta ttg ggt tea 720 
Cys Phe Tyr He Thr Cys He Leu Phe Val Val Leu Val Leu Gly Ser 
225 230 235 240 

ate get aaa gcg act ggt ttc agt ate ttc aaa ttt ate cgc tac ate 768 
He Ala Lys Ala Thr Gly Phe Ser He Phe Lys Phe He Arg Tyr He 
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245 250 255 

cgt gaa gaa ctg ctg att gta ctg ggg act tea tct tec gag teg gcg 816 
Arg Glu Glu Leu Leu He Val Leu Gly Thr Ser Ser Ser Glu Ser Ala 
260 265 270 

ctg ccg cgt atg etc gac aag atg gag aaa etc ggc tgc cgt aaa teg 864 
Leu Pro Arg Met Leu Asp Lys Met Glu Lys Leu Gly Cys Arg Lys Ser 
275 280 285 

gtg gtg ggg ctg gtc ate ccg aca ggc tac teg ttt aac ctt gat ggc 912 
Val Val Gly Leu Val He Pro Thr Gly Tyr Ser Phe Asn Leu Asp Gly 
290 295 300 

aca teg ata tac ctg aca atg gcg gcg gtg ttt ate gee cag gee act 960 
Thr Ser lie Tyr Leu Thr Met Ala Ala Val Phe He Ala Gin Ala Thr 
305 310 315 320 

aac agt cag atg gat ate gtc cac caa ate acg ctg tta ate gtg ttg 1008 
Asn Ser Gin Met Asp He Val His Gin He Thr Leu Leu He Val Leu 
325 330 335 

ctg ctt tct tct aaa ggg gcg gca ggg gta acg ggt agt ggc ttt ate 1056 
Leu Leu Ser Ser Lys Gly Ala Ala Gly Val Thr Gly Ser Gly Phe He 
340 345 350 

gtg ctg gcg gcg acg etc tct gcg gtg ggc cat ttg ccg gta gcg ggt 1104 
Val Leu Ala Ala Thr Leu Ser Ala Val Gly His Leu Pro Val Ala Gly 
355 360 365 

ctg gcg ctg ate etc ggt ate gac cgc ttt atg tea gaa get cgt gcg 1152 
Leu Ala Leu He Leu Gly He Asp Arg Phe Met Ser Glu Ala Arg Ala 
370 375 380 

ctg act aac ctg gtc ggt aac ggc gta gcg acc att gtc gtt get aag 1200 
Leu Thr Asn Leu Val Gly Asn Gly Val Ala Thr He Val Val Ala Lys 
385 . 390 395 400 

tgg gtg aaa gaa ctg gac cac aaa aaa ctg gac gat gtg ctg aat aat 1248 
Trp Val Lys Glu Leu Asp His Lys Lys Leu Asp Asp Val Leu Asn Asn 
405 410 415 

cgt gcg ccg gat ggc aaa acg cac gaa tta tec tct taa 1287 
Arg Ala Pro Asp Gly Lys Thr His Glu Leu Ser Ser * 
420 425 



<210> 202 
<211> 1191 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> {!)... (1191) 
<400> 202 

gtg acc acc cga cag cat teg teg ttt get att gtt ttt ate ctt ggc 48 
Met Thr Thr Arg Gin His Ser Ser Phe Ala He Val Phe He Leu Gly 
15 10 15 
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ctg ctg gcc atg ttg atg ccg ctg teg att gat atg tat ctg ccc gcg 96 
Leu Leu Ala Met Leu Met Pro Leu Ser lie Asp Met Tyr Leu Pro Ala 
20 25 30 

eta ccg gta att tea gcg cag ttt ggc gta ccg gcg ggc agt acg cag 144 
Leu Pro Val He Ser Ala Gin Phe Gly Val Pro Ala Gly Ser Thr Gin 
35 40 45 

atg acc etc agt act tat att ctg ggc ttt gcg ttg ggg cag tta ate 192 
Met Thr Leu Ser Thr Tyr He Leu Gly Phe Ala Leu Gly Gin Leu He 
50 55 60 

tac ggg ccg atg gca gac age ttc ggg cgt aag ccg gtg gtg etc ggc 240 
Tyr Gly Pro Met Ala Asp Ser Phe Gly Arg Lys Pro Val Val Leu Gly 
65 70 75 80 

ggt acg ctg gtg ttt gcc gcc gcc gcg gtg gcg tgt gcg ttg gca aac 288 
Gly Thr Leu Val Phe Ala Ala Ala Ala Val Ala Cys Ala Leu Ala Asn 
85 90 95 

acc ate gat cag ctg att gtg atg cgt ttc ttc cac ggg ctg get gcg 336 
Thr He Asp Gin Leu He Val Met Arg Phe Phe His Gly Leu Ala Ala 
100 105 110 

get gcg gcc age gtg gtc att aac gcc ctg atg cgc gat att tac ccg 384 
Ala Ala Ala Ser Val Val He Asn Ala Leu Met Arg Asp He Tyr Pro 
115 120 125 

aaa gaa gag ttc teg egg atg atg teg ttt gtc atg ctg gtg aca acc 432 
Lys Glu Glu Phe Ser Arg Met Met Ser Phe Val Met Leu Val Thr Thr 
130 135 140 

att gca ccg- ctg atg gca ccg ata gtt ggc ggc tgg gtg ctg gtg tgg 480 
He Ala Pro Leu Met Ala Pro He Val Gly Gly Trp Val Leu Val Trp 
145 150 155 160 

ctg age tgg cat tac ate ttc tgg ate ctg gca tta gcg gcg att ctg 528 
Leu Ser Trp His Tyr He Phe Trp He Leu Ala Leu Ala Ala He Leu 
165 170 175 

get teg gca atg att ttc ttc ctg att aaa gaa acc tta cca ccg gag 576 
Ala Ser Ala Met He Phe Phe Leu He Lys Glu Thr Leu Pro Pro Glu 
180 185 190 

cgt cgt cag cca ttt cac att cgt acc act att ggt aac ttt gcg gcg 624 
Arg Arg Gin Pro Phe His He Arg Thr Thr He Gly Asn Phe Ala Ala 
195 200 205 

ctg ttc cgc cat aaa cgt gtc ctg age tac atg ctt gcc agt ggt ttc 672 
Leu Phe Arg His Lys Arg Val Leu Ser Tyr Met Leu Ala Ser Gly Phe 
210 215 220 

age ttt gcc ggg atg ttc tea ttc tta age gcc gga ccg ttt gtt tat 720 
Ser Phe Ala Gly Met Phe Ser Phe Leu Ser Ala Gly Pro Phe Val Tyr 
225 230 235 240 

att gaa att aac cac gtc gcg ccg gaa aac ttt ggt tat tac ttt gcg 768 
He Glu He Asn His Val Ala Pro Glu Asn Phe Gly Tyr Tyr Phe Ala 
245 250 255 

eta aac att gtt ttt ctg ttc gtg atg acc ate ttt aac age cgc ttc 816 
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Leu Asn He Val Phe Leu Plie Val Met Thr lie Phe Asn Ser Arg Plie 
260 265 270 

gtc cgc cgc atfc ggc gcg tta aat atg ttc cgc teg ggg ttg tgg ata 864 
Val Arg Arg He Gly Ala Leu Asn Met Phe Arg Ser Gly Leu Trp He 
275 280 285 

caa ttt att atg gca gcg tgg atg gtc ate agt gcg ctg ctg ggg ctg 912 
Gin Phe He Met Ala Ala Trp Met Val He Ser Ala Leu Leu Gly Leu 
290 295 300 

gga ttt tgg teg ctg gtg gtt ggc gtt gcg gcg ttt gtg ggc tgc gtg 960 
Gly Phe Trp Ser Leu Val Val Gly Val Ala Ala Phe Val Gly Cys Val 
305 310 315 320 

teg atg gtg tea tec aat gcg atg gcg gtc att ctt gat gag ttt ccc 1008 
Ser Met Val Ser Ser Asn Ala Met Ala Val He Leu Asp Glu Phe Pro 
325 330 335 

cat atg gcg gga acg gca tct teg ctg gca gga ace ttc cgt ttt ggc 1056 
His Met Ala Gly Thr Ala Ser Ser Leu Ala Gly Thr Phe Arg Phe Gly 
340 345 350 

ata ggg gca att gtt ggc gca ttg ctt tct ctt gcg ace ttt aac tct 1104 
He Gly Ala He Val Gly Ala Leu Leu Ser Leu Ala Thr Phe Asn Ser 
355 360 365 

gca tgg ccg atg att tgg tea att gca ttc tgc gca ace age tec att 1152 
Ala Trp Pro Met He Trp Ser He Ala Phe Cys Ala Thr Ser Ser He 
370 375 380 

etc ttc tgt ctg tac gec agt egg ccg aaa aaa egg tga 1191 
Leu Phe Cys Leu Tyr Ala Ser Arg Pro Lys Lys Arg * 
385 390 395 



<210> 203 
<211> 696 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (696) 

<400> 203 

atg cga ctt gat aaa ttt ate gca cag caa etc ggc gtt age cgt get 4B 
Met Arg Leu Asp Lys Phe He Ala Gin Gin Leu Gly Val Ser Arg Ala 
15 10 15 

att gee ggg cgt gaa ate cgc ggc aat cgt gtc ace gtc gat ggc gaa 96 
He Ala Gly Arg Glu He Arg Gly Asn Arg Val Thr Val Asp Gly Glu 
20 25 30 

ate gtc cgt aat gca gcg ttc aaa ctg ctt cct gaa cat gat gtc get 144 
He Val Arg Asn Ala Ala Phe Lys Leu Leu Pro Glu His Asp Val Ala 
35 40 45 

tac gat ggc aac ccg ctg gcg cag caa cac ggt cca cgt tac ttc atg 192 
Tyr Asp Gly Asn Pro Leu Ala Gin Gin His Gly Pro Arg Tyr Phe Met 
50 55 60 
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etc aat aag cct cag ggc tat gtt tgc tec acg gac gac cct gat cac 240 
Leu Asn Lys Pro Gin Gly Tyr Val Cys Ser Thr Asp Asp Pro Asp His 
65 70 75 80 

cca acg gtg etc tat ttt ctt gat gaa ccg gta gcg tgg aaa ctg cat 288 
Pro Thr Val Leu Tyr Phe Leu Asp Glu Pro Val Ala Trp Lys Leu His 
85 90 95 

gcg gcg ggg egg ttg gat att gat ace acc ggt ctg gtg ctg atg act 336 
Ala Ala Gly Arg Leu Asp lie Asp Thr Thr Gly Leu Val Leu Met Thr 
100 105 110 

gat gat ggt cag tgg teg cac cgc att act tct ccg cgc cat cat tgc 384 
Asp Asp Gly Gin Trp Ser His Arg lie Thr Ser Pro Arg His His Cys 
115 120 125 

gag aag acc tat ctg gtg aca ctg gaa tea cct gta get gac gat acg 432 
Glu Lys Thr Tyr Leu Val Thr Leu Glu Ser Pro Val Ala Asp Asp Thr 
130 135 140 

gca gag caa ttt get aaa ggc gtg cag ctg cat aac gaa aaa gat etc 480 
Ala Glu Gin Phe Ala Lys Gly Val Gin Leu His Asn Glu Lys Asp Leu 
145 150 155 160 

act aag cct gcg gtg ctg gaa gtg att acc cca acg cag gtt cgt ctg 528 
Thr Lys Pro Ala Val Leu Glu Val lie Thr Pro Thr Gin Val Arg Leu 
165 170 175 

acc ate age gaa ggg cgt tat cat cag gtg aaa cgc atg ttc gee gee 576 
Thr lie Ser Glu Gly Arg Tyr His Gin Val Lys Arg Met Phe Ala Ala 
180 185 190 

gtg ggt aac cac gtg gtt gag ctg cat cgt gaa cgt att ggc ggt att 624 
Val Gly Asn His Val Val Glu Leu His Arg Glu Arg lie Gly Gly lie 
195 200 205 

acg ctg gat get gat tta gee ccc ggt gaa tat cgt ccg tta act gaa 672 
Thr Leu Asp Ala Asp Leu Ala Pro Gly Glu Tyr Arg Pro Leu Thr Glu 
210 215 220 

gaa gaa att gee age gtc gtc taa 696 
Glu Glu lie Ala Ser Val Val * 
225 230 



<210> 204 
<211> 1329 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ...(1329) 

<400> 204 

atg aaa aaa att gaa tgc get tgc aat ttt ctg atg gat aaa gat gcg 48 
Met Lys Lys lie Glu Cys Ala Cys Asn Phe Leu Met Asp Lys Asp Ala 
15 10 15 

cag ggg tat ate gac ctg tct gat ttg gat tta aca agt tgt cat ttt 96 
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Gin Gly Tyr lie Asp lieu Ser Asp Leu Asp Leu Thr Ser Cys His Phe 
20 25 30 

aaa ggt gac gtt ata teg aag gtg tct ttt tta tea tea aat eta caa 144 
Lys Gly Asp Val He Ser Lys Val Ser Phe Leu Ser Ser Asn Leu Gin 
35 40 45 

cat gta aca ttc gaa tgt aaa gaa att ggg gat tgc aat ttt act act 192 
His Val Thr Phe Glu Cys Lys Glu He Gly Asp Cys Asn Phe Thr Thr 
50 55 60 

gca ata gtt gat aat gtc ata ttt aga tgt cga cgt tta cac aat gtg 240 
Ala He Val Asp Asn Val He Phe Arg Cys Arg Arg Leu His Asn Val 
65 70 75 80 

att ttt ate aaa gcg agt ggt gaa tgt gtc gat ttc age aaa aat att 288 
He Phe He Lys Ala Ser Gly Glu Cys Val Asp Phe Ser Lys Asn He 
85 90 95 

ctt gat aca gtt gac ttc teg cag agt caa ctt ggt cat agt aat ttt 336 
Leu Asp Thr Val Asp Phe Ser Gin Ser Gin Leu Gly His Ser Asn Phe 
100 105 110 

cgc gaa tgt cag att aga aat tea aac ttc gat aat tgt tat ctt tac 384 
Arg Glu Cys Gin He Arg Asn Ser Asn Phe Asp Asn Cys Tyr Leu Tyr 
115 120 125 

get teg cac ttc ace aga gca gag ttt ctg tct gee aaa gaa ata tea 432 
Ala Ser His Phe Thr Arg Ala Glu Phe Leu Ser Ala Lys Glu He Ser 
130 135 140 

ttt att aaa teg aat ttg aca get gtt atg ttt gat tat gtg cga atg 480 
Phe He Lys Ser Asn Leu Thr Ala Val Met Phe Asp Tyr Val Arg Met 
145 150 155 160 

teg aca ggg aat ttt aaa gat tgc att aca gaa caa ttg gaa tta act 528 
Ser Thr Gly Asn Phe Lys Asp Cys He Thr Glu Gin Leu Glu Leu Thr 
165 170 175 

att gat tat tea gat ata ttt tgg aat gaa gat etc gat ggt tat ate 576 
He Asp Tyr Ser Asp He Phe Trp Asn Glu Asp Leu Asp Gly Tyr He 
180 185 190 

aat aac att ata aaa atg att gat aca ttg cca gat aat gca atg ata 624 
Asn Asn He He Lys Met He Asp Thr Leu Pro Asp Asn Ala Met He 
195 200 205 

ttg aaa tec gtt ctg gee gta aaa ctg gtg atg caa tta aaa ata ctt 672 
Leu Lys Ser Val Leu Ala Val Lys Leu Val Met Gin Leu Lys He Leu 
210 215 220 

aat att gtt aat aaa aac ttt att gag aat atg aag aaa ata ttt age 720 
Asn He Val Asn Lys Asn Phe He Glu Asn Met Lys Lys He Phe Ser 
225 230 235 240 

cat tgt cct tat ata aaa gat ccc att ata cgc agt tat ate cat tct 768 
His Cys Pro Tyr He Lys Asp Pro He He Arg Ser Tyr He His Ser 
245 250 255 

gat gaa gat aac aag ttc gat gat ttt atg cgt caa cat cga ttc agt 816 
Asp Glu Asp Asn Lys Phe Asp Asp Phe Met Arg Gin His Arg Phe Ser 
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260 265 270 

gag gtg aat ttc gat acc caa cag atg ate gat ttt att aac aga ttt 864 
Glu Val Asn Phe Asp Thr Gin Gin Met lie Asp Phe lie Asn Arg Phe 
275 280 285 

aat acg aat aaa tgg eta att gat aaa aat aac aat ttt ttt ate eaa 912 
Asn Thr Asn Lys Trp Leu lie Asp Lys Asn Asn Asn Phe Phe He Gin 
290 295 300 

ctt ate gat cag gee tta cga tea acg gat gat atg ate aaa gca aat 960 
Leu He Asp Gin Ala Leu Arg Ser Thr Asp Asp Met He Lys Ala Asn 
305 310 315 320 

gtt tgg cat ctt tat aaa gag tgg att cgt agt gat gat gtt tea cct 1008 
Val Trp His Leu Tyr Lys Glu Trp He Arg Ser Asp Asp Val Ser Pro 
325 330 335 

ata ttt ata gaa act gaa gat aat tta aga acc ttt aac acg aat gaa 1056 
He Phe He Glu Thr Glu Asp Asn Leu Arg Thr Phe Asn Thr Asn Glu 
340 345 350 

tta aca cga aac gat aat ate ttt ate ctg ttc tec tea gtc gat gat 1104 
Leu Thr Arg Asn Asp Asn He Phe He Leu Phe Ser Ser Val Asp Asp 
355 360 365 

ggg cca gtt atg gtg gta age tec cag cgc tta cat gat atg ttg aat 1152 
Gly Pro Val Met Val Val Ser Ser Gin Arg Leu His Asp Met Leu Asn 
370 375 380 

cct aca aaa gat acc aat tgg aat tec acg tat ate tac aaa tec aga 1200 
Pro Thr Lys Asp Thr Asn Trp Asn Ser Thr Tyr He Tyr Lys Ser Arg 
385 390 395 400 

cat gag atg ttg cct gtt aat ctt act cag gaa aca ctt ttc age tec 1248 
His Glu Met Leu Pro Val Asn Leu Thr Gin Glu Thr Leu Phe Ser Ser 
405 410 415 

aaa tct cat ggt aaa tat gcg ctt ttc cce att ttt act gcg agt tgg 1296 
Lys Ser His Gly Lys Tyr Ala Leu Phe Pro He Phe Thr Ala Ser Trp 
420 425 430 

cga get cat cgt ata atg aat aag ggt gtt taa 1329 
Arg Ala His Arg He Met Asn Lys Gly Val * 
435 440 



<210> 205 
<211> 717 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (717) 

<400> 205 

gtg gga cgt aaa tgg gee aat att gtt get aaa aaa acg get aaa gac 48 
Met Gly Arg Lys Trp Ala Asn He Val Ala Lys Lys Thr Ala Lys Asp 
15 10 15 
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ggt gca acg tct aaa att tat gca aaa ttc ggt gta gaa ate tat get 96 
Gly Ala Thr Ser Lys lie Tyr Ala Lys Phe Gly Val Glu He Tyr Ala 
20 25 30 

get get aaa caa ggt gaa ccc gat cca gaa tta aac aca tct tta aaa 144 
Ala Ala Lys Gin Gly Glu Pro Asp Pro Glu Leu Asn Thr Ser Leu Lys 
35 40 45 

ttc gtt att gaa cgt gca aag cag gca caa gtt cca aag cac gtt att 192 
Phe Val He Glu Arg Ala Lys Gin Ala Gin Val Pro Lys His Val He 
50 55 60 

gat aaa gca att gat aaa gec aaa ggc ggc gga gat gaa acg ttc gtg 240 
Asp Lys Ala He Asp Lys Ala Lys Gly Gly Gly Asp Glu Thr Phe Val 
65 70 75 80 

cag gga cgt tat gaa ggc ttt ggt cct aat ggc tea atg att ate gec 288 
Gin Gly Arg Tyr Glu Gly Phe Gly Pro Asn Gly Ser Met He He Ala 
85 90 95 

gag aca ttg act tea aat gtt aac cgt acg att get aac gtt cgc aca 336 
Glu Thr Leu Thr Ser Asn Val Asn Arg Thr He Ala Asn Val Arg Thr 
X00 105 110 

att ttc aat aaa aaa ggc ggc aat ate gga gcg gca ggt tct gtc age 384 
He Phe Asn Lys Lys Gly Gly Asn He Gly Ala Ala Gly Ser Val Ser 
115 120 125 

tat atg ttt gac aat acg ggt gtg att gta ttt aaa ggg aca gac cct 432 
Tyr Met Phe Asp Asn Thr Gly Val He Val Phe Lys Gly Thr Asp Pro 
130 135 140 

gac cat att ttt gaa att tta ctt gaa get gaa gtt gat gtt cgt gat 480 
Asp His He Phe Glu He Leu Leu Glu Ala Glu Val Asp Val Arg Asp 
145 150 155 160 

gtg act gaa gaa gaa ggt aac att gtt att tat act gaa cct act gac 528 
Val Thr Glu Glu Glu Gly Asn He Val He Tyr Thr Glu Pro Thr Asp 
165 170 175 

ctt cat aaa gga ate gcg get eta aaa gca get gga ate act gag ttc 576 
Leu His Lys Gly He Ala Ala Leu Lys Ala Ala Gly He Thr Glu Phe 
180 185 190 

tea aca aca gaa tta gaa atg att get caa tct gaa gtt gag ctt tec 624 
Ser Thr Thr Glu Leu Glu Met He Ala Gin Ser Glu Val Glu Leu Ser 
195 200 205 

cca gaa gat tta gaa ate ttt gaa ggg ctt gtt gat gec ctt gaa gat 672 
Pro Glu Asp Leu Glu He Phe Glu Gly Leu Val Asp Ala' Leu Glu Asp 
210 215 220 

gac gac gat gta caa aaa gtt tat cat aac gtc gca aat etc taa 717 
Asp Asp Asp Val Gin Lys Val Tyr His Asn Val Ala Asn Leu * 
225 230 235 



<210> 206 
<211> 786 
<212> I3NA 
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<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (786) 

<400> 206 

gtg gtt ctt atg tec gaa aca aaa aac gaa ctt gaa gac ctg ctg gaa 48 
Met Val Leu Met Ser Glu Thr Lys Asn Glu Leu Glu Asp Leu Leu Glu 
15 10 15 

aaa gca gca act gaa ccg gcg cac cgc ccg gec ttt ttc cgt act eta 96 
Lys Ala Ala Thr Glu Pro Ala His Arg Pro Ala Phe Phe Arg Thr Leu 
20 25 30 

ctg gaa tec acc gtc tgg gtg cct ggt acg gcg gcg cag ggc gag get 144 
Leu Glu Ser Thr Val Trp Val Pro Gly Thr Ala Ala Gin Gly Glu Ala 
35 40 45 

gtg gtt gaa gat age gcg ctt gat tta cag cac tgg gaa aaa gaa gac 192 
Val Val Glu Asp Ser Ala Leu Asp Leu Gin His Trp Glu Lys Glu Asp 
50 55 60 

ggc acc age gtc att cct ttt ttc acc teg tta gaa gca ctt caa cag 240 
Gly Thr Ser Val lie Pro Phe Phe Thr Ser Leu Glu Ala Leu Gin Gin 
65 70 75 80 

gcg gtt gaa gac gaa cag gca ttt gtc gta atg ccc gtt cgc acg ctg 288 
Ala Val Glu Asp Glu Gin Ala Phe Val Val Met Pro Val Arg Thr Leu 
85 90 95 

ttt gag atg aca ctt ggc gaa acg etc ttc ctt aat gee aaa ctg cca 336 
Phe Glu Met Thr Leu Gly Glu Thr Leu Phe Leu Asn Ala Lys Leu Pro 
100 105 110 

acc ggt aaa gaa ttt atg ccg cgt gaa ate agt ttg ttg att ggt gaa 384 
Thr Gly Lys Glu Phe Met Pro Arg Glu He Ser Leu Leu He Gly Glu 
115 120 125 

gag gga aat ccg ctg age age cag gaa ate ctg gaa ggc ggt gaa teg 432 
Glu Gly Asn Pro Leu Ser Ser Gin Glu He Leu Glu Gly Gly Glu Ser 
130 135 140 

ctg ata tta teg gaa gtc gca gag ccg cca gca caa atg att gat tea 480 
Leu He Leu Ser Glu Val Ala Glu Pro Pro Ala Gin Met He Asp Ser 
145 150 155 160 

etc acc acc tta ttt aaa acc att aag ccg gtg aag cgt get ttt att 528 
Leu Thr Thr Leu Phe Lys Thr He Lys Pro Val Lys Arg Ala Phe He 
165 170 175 

tgt tea att aaa gag aac gaa gag gca cag cct aat tta ctt att ggc 576 
Cys Ser He Lys Glu Asn Glu Glu Ala Gin Pro Asn Leu Leu He Gly 
180 185 190 

att gaa gec gat ggt gat ate gaa gaa att att cag gcg acg gga agt 624 
He Glu Ala Asp Gly Asp He Glu Glu He He Gin Ala Thr Gly Ser 
195 200 205 

gta gcg acc gat aca tta cct ggc gat gaa cca ate gat att tgt cag 672 
Val Ala Thr Asp Thr Leu Pro Gly Asp Glu Pro He Asp He Cys Gin 
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210 215 220 

gtg aaa aaa ggg gaa aaa gga att age cac ttt att acc gaa cat att 720 
Val Lys Lys Gly Glu Lys Gly lie Ser His Phe He Thr Glu His He 
225 230 235 240 

gcg cca ttc tat gaa cgt cgc tgg ggt ggt ttt ttg cgt gac ttt aaa 768 
Ala Pro Phe Tyr Glu Arg Arg Trp Gly Gly Phe Leu Arg Asp Phe Lys 
245 250 255 

cag aat egg ata ate taa 786 
Gin Asn Arg He He * 
260 



<210> 207 
<211> 1266 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1}... (1266) 

<400> 207 

atg etc acg aaa aag aaa tgg gcg tta ttt agt eta tta aca ctg tgt 48 
Met Leu Thr Lys Lys Lys Trp Ala Leu Phe Ser Leu Leu Thr Leu Cys 
15 10 15 

ggc ggt aca att tat aaa tta ccg teg ctg aaa gat gcg ttt tat ate 96 
Gly Gly Thr He Tyr Lys Leu Pro Ser Leu Lys Asp Ala Phe Tyr He 
20 25 30 

ccg atg cag gaa tat ttc cat ttg acc aat ggt caa att ggt aat get 144 
Pro Met Gin Glu Tyr Phe His Leu Thr Asn Gly Gin He Gly Asn Ala 
35 40 45 

atg teg gta aac tea ttt gtc acc aca gtg ggc ttt ttt ctg tct att 192 
Met Ser Val Asn Ser Phe Val Thr Thr Val Gly Phe Phe Leu Ser He 
50 55 60 

tat ttt gee gat aaa eta ccg cgc aga tac acc atg tea ttc tea etc 240 
Tyr Phe Ala Asp Lys Leu Pro Arg Arg Tyr Thr Met Ser Phe Ser Leu 
65 70 75 80 

att gcg aca gga tta ctg ggt gtt tat ttg acg aca atg ccg ggg tat 288 
He Ala Thr Gly Leu Leu Gly Val Tyr Leu Thr Thr Met Pro Gly Tyr 
85 90 95 

tgg ggc ate etc ttt gtc tgg gcg eta ttt ggc gtt act tgc gac atg 336 
Trp Gly He Leu Phe Val Trp Ala Leu Phe Gly Val Thr Cys Asp Met 
100 105 110 

atg aac tgg ccg gtc ttg etc aag teg gta agt cga ttg ggc aat age 384 
Met Asn Trp Pro Val Leu Leu Lys Ser Val Ser Arg Leu Gly Asn Ser 
115 120 125 

gaa caa caa ggt egg ttg ttt ggc ttc ttc gaa aca ggg cgt ggc att 432 
Glu Gin Gin Gly Arg Leu Phe Gly Phe Phe Glu Thr Gly Arg Gly He 
130 135 140 
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gtc gat acc gtg gtg gca ttt tct gcg ttg gca gta ttt acc tgg ttt 480 
Val Asp Thr Val Val Ala Phe Ser Ala Leu Ala Val Phe Thr Trp Phe 
145 150 155 160 

ggc agt ggc tta tta ggt ttt aaa gca ggc ate tgg ttc tat tec ctt 528 
Gly Ser Gly Leu Leu Gly Phe Lys Ala Gly lie Trp Phe Tyr Ser Leu 
165 170 175 

att gtg att gec gta ggc att att att ttc ttt gtc ctg aat gac aaa 576 
lie Val lie Ala Val Gly lie lie lie Phe Phe Val Leu Asn Asp Lys 
180 185 190 

gaa gag gca ccg tec gtt gag gtg aaa aaa gaa gac gga gca teg aaa 624 
Glu Glu* Ala Pro Ser Val Glu Val Lys Lys Glu Asp Gly Ala Ser Lys 
195 200 205 

aac acc agt atg acc teg gtg ctg aaa gac aaa act ate tgg ctt ate 672 
Asn Thr Ser Met Thr Ser Val Leu Lys Asp Lys Thr lie Trp Leu He 
210 215 220 

get ttt aac gtc ttc ttc gtt tac gcg gtt tac tgt ggc ctg aca ttc 720 
Ala Phe Asn Val Phe Phe Val Tyr Ala Val Tyr Cys Gly Leu Thr Phe 
225 230 235 240 

ttc att cca ttc ctg aaa aac ate tat eta ttg ccc gtt gcg ctg gtg 768 
Phe lie Pro Phe Leu Lys Asn He Tyr Leu Leu Pro Val Ala Leu Val 
245 250 255 

ggg get tac ggc ate att aac caa tac tgt ctg aaa atg att ggt gga 816 
Gly Ala Tyr Gly He He Asn Gin Tyr Cys Leu Lys Met He Gly Gly 
260 265 270 

ccg att ggt ggc atg att tea gat aag ate ctg aaa teg cca agt aaa 864 
Pro He Gly Gly Met He Ser Asp Lys He Leu Lys Ser Pro Ser Lys 
275 280 285 

tat eta tgc tac acc ttt ate att agt acc get gcg etc gta ctg ttg 912 
Tyr Leu Cys Tyr Thr Phe He He Ser Thr Ala Ala Leu Val Leu Leu 
290 295 300 

att atg ctg ccg cac gaa agt atg ccg gtc tat tta ggg atg gca tgt 960 
He Met Leu Pro His Glu Ser Met Pro Val Tyr Leu Gly Met Ala Cys 
305 310 315 320 

acg ctg ggc ttt ggc gcg ata gtc ttt aca cag cga gee gta ttt ttt 1008 
Thr Leu Gly Phe Gly Ala He Val Phe Thr Gin Arg Ala Val Phe Phe 
325 330 335 

gca cct ate ggc gaa gca aaa att get gaa aat aaa aca ggt gcg gcg 1056 
Ala Pro He Gly Glu Ala Lys He Ala Glu Asn Lys Thr Gly Ala Ala 
340 345 350 

atg gcg ttg ggt age ttt att ggt tac get ccg gcg atg ttc tgc ttc 1104 
Met Ala Leu Gly Ser Phe He Gly Tyr Ala Pro Ala Met Phe Cys Phe 
355 360 365 

agt ctg tat ggc tac att ctg gat tta aat ccg ggg att att ggc tac 1152 
Ser Leu Tyr Gly Tyr He Leu Asp Leu Asn Pro Gly He He Gly Tyr 
370 375 380 

aaa ate gtg ttt ggc att atg gee tgc ttc gca ttc agt ggt gcg gtg 1200 
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Lys lie Val Phe Gly He Met Ala Cys Phe Ala Phe Ser Gly Ala Val 
385 390 395 400 

gtt tec gta atg ctg gtt aag cgt att age caa cgt aag aaa gag atg 1248 
Val Ser Val Met Leu Val Lys Arg He Ser Gin Arg Lys Lys Glu Met 
405 410 415 

ctg gcg get gaa get taa 1266 
Leu Ala Ala Glu Ala * 
420 



<210> 208 
<211> 2313 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2313) 

<400> 208 

atg cct cgc ttg tta acc aaa cgc ggc tgc tgg ata acg ttg gca gee 48 
Met Pro Arg Leu Leu Thr Lys Arg Gly Cys Trp He Thr Leu Ala Ala 
15 10 15 

gcg ccc ttt etc ctt ttt ctt gca gcg tgg gga gca gat aaa etc tgg 96 
Ala Pro Phe Leu Leu Phe Leu Ala Ala Trp Gly Ala Asp Lys Leu Trp 
20 25 30 

cct eta ccg ctg cat gaa gtc aat ccc gca cga gtg gtc gtg gcg cag 144 
Pro Leu Pro Leu His Glu Val Asn Pro Ala Arg Val Val Val Ala Glu 
35 40 45 

gat ggt acg ccg etc tgg cgc ttc gee gat get gac ggc ate tgg cgt 192 
Asp Gly Thr Pro Leu Trp Arg Phe Ala Asp Ala Asp Gly He Trp Arg 
50 55 60 

tat ccg gta aca ate gaa gat gtt tct cca cgt tac ctt gaa gcg ctg 240 
Tyr Pro Val Thr He Glu Asp Val Ser Pro Arg Tyr Leu Glu Ala Leu 
65 70 75 80 

ate aat tat gaa gat cgc tgg ttc tgg aag cat ccg ggg gtg aat cca 288 
He Asn Tyr Glu Asp Arg Trp Phe Trp Lys His Pro Gly Val Asn Pro 
85 90 95 

ttc teg gtg gcg cgc gca gca tgg caa gat etc act teg gga egg gtt 336 
Phe Ser Val Ala Arg Ala Ala Trp Gin Asp Leu Thr Ser Gly Arg Val 
100 105 110 

att tec ggt ggc age acg etc act atg cag gtt get cgt ctg ctt gat 384 
He Ser Gly Gly Ser Thr Leu Thr Met Gin Val Ala Arg Leu Leu Asp 
115 120 125 

cct cac ccc aaa aca ttt ggc ggc aaa att cgc cag etc tgg cgc gcg 432 
Pro His Pro Lys Thr Phe Gly Gly Lys He Arg Gin Leu Trp Arg Ala 
130 135 140 

ttg caa ctg gaa tgg cat ctg tct aag cgt gaa att ctg acc ttg tat 480 
Leu Gin Leu Glu Trp His Leu Ser Lys Arg Glu He Leu Thr Leu Tyr 
145 150 155 160 
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ctt aac cgc get ccg ttt ggc ggt acg ttg cag ggg ate ggt gcg gca 528 
Leu Asn Arg Ala Pro Phe Gly Gly Thr lieu Gin Gly lie Gly Ala Ala 
165 170 175 

agt tgg get tat etc gga aaa teg cct gcg aat tta age tat tec gag 576 
Ser Trp Ala Tyr Leu Gly Lys Ser Pro Ala Asn Leu Ser Tyr Ser Glu 
180 185 190 

gcg gca atg ctg gcg gtt ttg ccg caa gcg ccc age cgt ctt cgc ccg 624 
Ala Ala Met Leu Ala Val Leu Pro Gin Ala Pro Ser Arg Leu Arg Pro 
195 200 205 

gat cgt tgg ccg gag cgt gee gaa gee gcg cgt aat aaa gtg etc gaa 672 
Asp Arg Trp Pro Glu Arg Ala Glu Ala Ala Arg Asn Lys Val Leu Glu 
210 215 220 

egg atg gee gtg caa ggt gtg tgg tec cgt gag cag gta aaa gag tea 720 
Arg Met Ala Val Gin Gly Val Trp Ser Arg Glu Gin Val Lys Glu Ser 
225 230 235 240 

agg gaa gaa ccc ate tgg ctg gec ccc cga caa atg ccg caa ctg gca 768 
Arg Glu Glu Pro lie Trp Leu Ala Pro Arg Gin Met Pro Gin Leu Ala 
245 250 255 

ccg ctg ttt teg cgc atg atg etc ggt aaa age aaa age gac aaa ate 816 
Pro Leu Phe Ser Arg Met Met Leu Gly Lys Ser Lys Ser Asp Lys He 
260 265 270 

act act acg ttg gat gee ggt ctt caa cga cgt ctg gaa gaa ctg gcg 864 
Thr Thr Thr Leu Asp Ala Gly Leu Gin Arg Arg Leu Glu Glu Leu Ala 
275 280 285 

caa aac tgg aaa ggg egg ttg cca ccg cgc age tea ctg gcg atg ate 912 
Gin Asn Trp Lys Gly Arg Leu Pro Pro Arg Ser Ser Leu Ala Met He 
290 295 300 

gtg gtt gat cat ace gat atg cgt gtt cgc ggc tgg gtg gga teg gtt 960 
Val Val Asp His Thr Asp Met Arg Val Arg Gly Trp Val Gly Ser Val 
305 310 315 320 

gat etc aac gat gat tea cgc ttt ggt cat gtt gat atg gtc aat teg 1008 
Asp Leu Asn Asp Asp Ser Arg Phe Gly His Val Asp Met Val Asn Ser 
325 330 335 

ate cga teg cca gga tea gtg etc aaa ccg ttt gtt tat ggt ctg gcg 1056 
He Arg Ser Pro Gly Ser Val Leu Lys Pro Phe Val Tyr Gly Leu Ala 
340 345 350 

ctg gat gaa ggc ttg ate cac ccg gca tea ctg ctg caa gac gtc ccc 1104 
Leu Asp Glu Gly Leu He His Pro Ala Ser Leu Leu Gin Asp Val Pro 
355 360 365 

egg cgc acc ggt gat tat cga cca ggt aac ttt gat age ggt ttt cat 1152 
Arg Arg Thr Gly Asp Tyr Arg Pro Gly Asn Phe Asp Ser Gly Phe His 
370 375 380 

ggc ccg ate age atg age gag gcg ctg gtg cgc teg ctg aac tta cct 1200 
Gly Pro He Ser Met Ser Glu Ala Leu Val Arg Ser Leu Asn Leu Pro 
385 390 395 400 
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get gtg cag gtg ctg gaa gec tat gga ccg aaa egg ttt gcg gca aag 1248 
Ala Val Gin Val Leu Glu Ala Tyr Gly Pro Lys Arg Phe Ala Ala Lys 
405 410 415 

tta cgc aat gtt gga ttg ccg tta tat ttg ccc aac ggt get gcg ccg 1296 
Leu Arg Asn Val Gly Leu Pro Leu Tyr Leu Pro Asn Gly Ala Ala Pro 
420 425 430 

aat ctt tea etc att etc ggc ggc get ggt gca aaa ctg gaa gat atg 1344 
Asn lieu Ser Leu lie Leu Gly Gly Ala Gly Ala Lys Leu Glu Asp Met 
435 440 445 

gcg gca gcg tat acc gcg ttt get cgc cac ggc aag gca ggc aaa ttg 1392 
Ala Ala Ala Tyr Thr Ala Phe Ala Arg His Gly Lys Ala Gly Lys Leu 
450 455 460 

cgc tta cag cct gac gat ccg ctg ctt gaa egg cct tta atg teg teg 1440 
Arg Leu Gin Pro Asp Asp Pro Leu Leu Glu Arg Pro Leu Met Ser Ser 
465 470 475 480 

ggg gcg gcg tgg atc att cgc egg att atg get gat gaa gcg caa ccc 1488 
Gly Ala Ala Trp lie lie Arg Arg lie Met Ala Asp Glu Ala Gin Pro 
485 490 495 

ttg ccg gat agt gee ttg ccg cgc gtc gee cca ctg gca tgg aaa acg 1536 
Leu Pro Asp Ser Ala Leu Pro Arg Val Ala Pro Leu Ala Trp Lys Thr 
500 505 510 

ggc acc age tat ggc tat cgt gac gee tgg gcg att ggg gtt aac get 1584 
Gly Thr Ser Tyr Gly Tyr Arg Asp Ala Trp Ala lie Gly Val Asn Ala 
515 520 525 

cgc tat gtc att ggg atc tgg act ggc aga ccg gac ggc acg ccc gtt 1632 
Arg Tyr Val He Gly He Trp Thr Gly Arg Pro Asp Gly Thr Pro Val 
530 535 540 

gtt ggt cag ttt ggc ttt gec agt gee gta cca ttg ttg aat cag gtc 1680 
Val Gly Gin Phe Gly Phe Ala Ser Ala Val Pro Leu Leu Asn Gin Val 
545 550 555 560 

aat aat atc tta ctg teg cgc agt gcg aat ctg ccg gaa gac ccg cgc 1728 
Asn Asn He Leu Leu Ser Arg Ser Ala Asn Leu Pro Glu Asp Pro Arg 
565 570 575 

ccg aac tea gtc act cgt ggc gtt atc tgc tgg ccg ggc gga cag tct 1776 
Pro Asn Ser Val Thr Arg Gly Val He Cys Trp Pro Gly Gly Gin Ser 
580 585 590 

ttg ccg gaa ggt gac ggt aac tgt cgc cgc cgc ctg gca acc tgg ctg 1824 
Leu Pro Glu Gly Asp Gly Asn Cys Arg Arg Arg Leu Ala Thr Trp Leu 
595 600 605 

ctg gac ggg agt cag ccg cca act eta tta ctg ccg gag cag gaa ggc 1B72 
Leu Asp Gly Ser Gin Pro Pro Thr Leu Leu Leu Pro Glu Gin Glu Gly 
610 615 620 

att aat ggc att cgt ttc ccc atc tgg ctg gat gaa aat ggc aaa cgt 1920 
He Asn Gly He Arg Phe Pro He Trp Leu Asp Glu Asn Gly Lys Arg 
625 630 635 640 

gtt gec get gat tgc ccg caa gcg cga caa gaa atg ata aat gtc tgg 1968 
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Val Ala Ala Asp Cys Pro Gin Ala Arg Gin Glu Met lie Asn Val Trp 
645 650 655 

ccg eta ccg ctg gaa ccc tgg ctg ccc gca tea gaa cgt cgc get gtg 2016 
Pro lieu Pro Leu Glu Pro Trp Leu Pro Ala Ser Glu Arg Arg Ala Val 
660 665 670 

cgc tta cca ccg gee teg aca agt tgc ccg cca tac ggt cac gat gee 2064 
Arg Leu Pro Pro Ala Ser Thr Ser Cys Pro Pro Tyr Gly His Asp Ala 
675 680 685 

cag etc ccg ctg cag ctg aca ggc gtt cgc gat ggc gcg att att aaa 2112 
Gin Leu Pro Leu Gin Leu Thr Gly Val Arg Asp Gly Ala lie lie Lys 
690 695 700 

cgt tta ccg ggc gca gcg gaa gcg act ttg ccg ttg caa tea agt gga 2160 
Arg Leu Pro Gly Ala Ala Glu Ala Thr Leu Pro Leu Gin Ser Ser Gly 
705 710 715 720 

ggST gca ggt gaa cgc tgg tgg ttt ctg aat ggc gaa ccg tta act gaa 2208 
Gly Ala Gly Glu Arg Trp Trp Phe Leu Asn Gly Glu Pro Leu Thr Glu 
725 730 735 

cgc ggg cgc aac gtg ace ctg cat ttg acg gat aaa ggt gat tat caa 2256 
Arg Gly Arg Asn Val Thr Leu His Leu Thr Asp Lys Gly Asp Tyr Gin 
740 745 750 

ttg ctg gtg atg gat gac gtg ggg caa ate gcg aca gtg aaa ttt gtc 2304 
Leu Leu Val Met Asp Asp Val Gly Gin He Ala Thr Val Lys Phe Val 
755 760 765 

atg caa tag 2313 
Met Gin * 
770 



<210> 209 
<211> 4962 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (4962) 

<400> 209 

atg aaa aag tta cgc gta gee gee tgc atg eta atg ctg gcg ctg gca 48 
Met Lys Lys Leu Arg Val Ala Ala Cys Met Leu Met Leu Ala Leu Ala 
15 10 15 



ggg tgc gac aac aac gat aac gcg cca aca gcg gtg aaa aaa gat gcg 96 
Gly Cys Asp Asn Asn Asp Asn Ala Pro Thr Ala Val Lys Lys Asp Ala 
20 25 30 

cct tct gaa gtt act aaa gcg gee tct tea gaa aac gcg agt tea gca 144 
Pro Ser Glu Val Thr Lys Ala Ala Ser Ser Glu Asn Ala Ser Ser Ala 
35 40 45 

aaa etc tec gtg ccg gag aga caa aaa ctg gec caa cag agt gec ggt 192 
Lys Leu Ser Val Pro Glu Arg Gin Lys Leu Ala Gin Gin Ser Ala Gly 
50 55 60 



-182- 



WO 01/48209 



PCT/US00/34419 



aag gtg ctg aca ttg ctg gat etc tct gaa gtc caa ctt gat ggt gca 240 
Lys Val Leu Thr Leu Leu Asp Leu Ser Glu Val Gin Leu Asp Gly Ala 
65 70 75 80 

gec acg ctg gtg ctg acg ttc tec ate cct etc gac ccg gat cag gat 288 
Ala Thr Leu Val Leu Thr Phe Ser lie Pro Leu Asp Pro Asp Gin Asp 
85 90 95 

ttc tea cgc gtt att cat gtc gtc gat aaa aaa age ggc aaa gtg gat 336 
Phe Ser Arg Val He His Val Val Asp Lys Lys Ser Gly Lys Val Asp 
100 105 110 

ggt gee tgg gag ctg tea gat aat ctt aaa gag tta cgt tta cgc cac 384 
Gly Ala Trp Glu Leu Ser Asp Asn Leu Lys Glu Leu Arg Leu Arg His 
115 120 125 

etc gaa ccg aaa cgt gat ttg ate gtt act att ggc aag gag gtc aaa 432 
Leu Glu Pro Lys Arg Asp Leu He Val Thr He Gly Lys Glu Val Lys 
130 135 140 

gca etc aac aac gca ace ttc agt aaa gat tac gaa aaa act ata act 480 
Ala Leu Asn Asn Ala Thr Phe Ser Lys Asp Tyr Glu Lys Thr He Thr 
145 150 155 160 

ace cgc gac ate caa ccc age gtc ggt ttt gee age cgt ggt teg ctg 528 
Thr Arg Asp He Gin Pro Ser Val Gly Phe Ala Ser Arg Gly Ser Leu 
165 170 175 

ctg cct ggc aaa gtc gtt gaa ggg ctg ccg gta atg gcg etc aac gtt 576 
Leu Pro Gly Lys Val Val Glu Gly Leu Pro Val Met Ala Leu Asn Val 
180 185 190 

aat aat gtc gat gtt aac ttc ttc cgc gtt aag cca gaa tct ctg cca 624 
Asn Asn Val Asp Val Asn Phe Phe Arg Val Ijys Pro Glu Ser Leu Pro 
195 200 205 

gca ttc att age caa tgg gaa tac cgc aat teg ctg gcg aac tgg cag 672 
Ala Phe He Ser Gin Trp Glu Tyr Arg Asn Ser Leu Ala Asn Trp Gin 
210 215 220 

tea gac aaa ctg ctg cag atg gcg gat ctg gtc tac ace gga egg ttt 720 
Ser Asp Lys Leu Leu Gin Met Ala Asp Leu Val Tyr Thr Gly Arg Phe 
225 230 235 240 

gat etc aat cct gcg cgt aac ace cgt gaa aaa tta ttg ctg ccg ctg 768 
Asp Leu Asn Pro Ala Arg Asn Thr Arg Glu Lys Leu Leu Leu Pro Leu 
245 250 255 

ggc gat ate aaa ccg ctt cag cag gcg ggc gtg tat ctg get gtg atg 816 
Gly Asp He Lys Pro Leu Gin Gin Ala Gly Val Tyr Leu Ala Val Met 
260 265 270 

aat cag get gga cgt tac gat tac agt aat ccc gcg acg ctg ttt acg 864 
Asn Gin Ala Gly Arg Tyr Asp Tyr Ser Asn Pro Ala Thr Leu Phe Thr 
275 280 285 

tta agt gat ate ggc gtt tea get cac cgt tat cac aat cgt ctg gat 912 
Leu Ser Asp He Gly Val Ser Ala His Arg Tyr His Asn Arg Leu Asp 
290 295 300 
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ate ttt acc caa agt ctg gaa aac ggc gcg gec cag caa gga att gaa 960 
lie Phe Thr Gin Ser Leu Glu Asn Gly Ala Ala Gin Gin Gly lie Glu 
305 310 315 320 

gtc tct tta tta aat gag aaa ggg cag act ctg act cag gca acc agt 1008 
Val Ser Leu Leu Asn Glu Lys Gly Gin Thr Leu Thr Gin Ala Thr Ser 
325 330 335 

gac get cag ggg cat gtg cag ctg gaa aat gat aaa aac gcg gca tta 1056 
Asp Ala Gin Gly His Val Gin Leu Glu Asn Asp Lys Asn Ala Ala lieu 
340 345 350 

ctg ttg gcg cgt aaa gac ggt cag aca acg eta etc gat tta aaa ctt 1104 
Leu Leu Ala Arg Lys Asp Gly Gin Thr Thr Leu Leu Asp Leu Lys Leu 
355 360 365 

ccg gcg ctg gac tta gca gaa ttt aac att get ggc gcg cca ggc tat 1152 
Pro Ala Leu Asp Leu Ala Glu Phe Asn lie Ala Gly Ala Pro Gly Tyr 
370 375 380 

age aaa cag ttt ttc atg ttt ggc cca cgc gat ctt tat cgc cca ggt 1200 
Ser Lys Gin Phe Phe Met Phe Gly Pro Arg Asp Leu Tyr Arg Pro Gly 
385 390 395 ~ 400 

gaa acg gta ate etc aat ggt ttg ctg cgt gat gca gac ggt aaa gcg 1248 
Glu Thr Val lie Leu Asn Gly Leu Leu Arg Asp Ala Asp Gly Lys Ala 
405 410 415 

ttg ccc aat caa ccc ate aag tta gac gtg att aaa ccc gat ggg cag 1296 
Leu Pro Asn Gin Pro lie Lys Leu Asp Val He Lys Pro Asp Gly Gin 
420 425 430 

gta etc agg age gtc gtt agt cag ccg gag aat ggc etc tac cac ttt 1344 
Val Leu Arg Ser Val Val Ser Gin Pro Glu Asn Gly Leu Tyr His Phe 
435 440 445 

acc tgg cca etc gat age aat gcg gca acc ggt atg tgg cat att cgc 1392 
Thr Trp Pro Leu Asp Ser Asn Ala Ala Thr Gly Met Trp His He Arg 
450 455 460 

get aac acg ggc gat aat cag tat egg atg tgg gat ttc cac gtc gaa 1440 
Ala Asn Thr Gly Asp Asn Gin Tyr Arg Met Trp Asp Phe His Val Glu 
465 470 475 480 

gat ttt atg cca gag cgc atg gcg ctg aat ctg acc ggt gag aaa acc 1488 
Asp Phe Met Pro Glu Arg Met Ala Leu Asn Leu Thr Gly Glu Lys Thr 
485 490 495 

ccg eta acg ccg aaa gat gaa gtg aaa ttc tec gtg gtg ggg tac tac 1536 
Pro Leu Thr Pro Lys Asp Glu Val Lys Phe Ser Val Val Gly Tyr Tyr 
500 505 510 

ctg tat ggt gca cct get aat ggt aat act ttg caa ggg caa ctt ttc 1584 
Leu Tyr Gly Ala Pro Ala Asn Gly Asn Thr Leu Gin Gly Gin Leu Phe 
515 520 525 

ctg cgc cca ctg cgt gaa get gtg tea gee tta cct ggt ttt gaa ttc 1632 
Leu Arg Pro Leu Arg Glu Ala Val Ser Ala Leu Pro Gly Phe Glu Phe 
530 535 540 

ggc gat ata get gec gaa aat ctt tec cgc acg ctg gat gaa gtt cag 1680 
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Gly Asp lie Ala Ala Glu Asn Leu Ser Arg Thr Leu Asp Glu Val Gin 
545 550 555 560 

ttg acg ctg gat gat aaa ggg cgc ggc gaa gtt tct aca gaa age cag 1728 
Leu Thr Leu Asp Asp Lys Gly Arg Gly Glu Val Ser Thr Glu Ser Gin 
565 570 575 

tgg aag gaa acg cat tec cca tta cag gtt att ttc cag ggt agt ttg 1776 
Trp Lys Glu Thr His Ser Pro Leu Gin Val He Phe Gin Gly Ser Leu 
580 585 590 

ctg gaa teg ggc ggt cgc ccg gtg acg cgc cgc get gag cag get ate 1824 
Leu Glu Ser Gly Gly Arg Pro Val Thr Arg Arg Ala Glu Gin Ala He 
595 600 605 

tgg cct gee gat gca ttg ccg ggg ate cgt ccg cag ttc gee teg aaa 1872 
Trp Pro Ala Asp Ala Leu Pro Gly He Arg Pro Gin Phe Ala Ser Lys 
610 615 620 

teg gtt tac gat tat cgt act gac age acg gtg aaa cag ccc att gtt 1920 
Ser Val Tyr Asp Tyr Arg Thr Asp Ser Thr Val Lys Gin Pro He Val 
625 630 635 640 

gat gaa ggc agt aac gee get ttt gac ate gtt tat age gat gcg caa 1968 
Asp Glu Gly Ser Asn Ala Ala Phe Asp He Val Tyr Ser Asp Ala Gin 
645 650 655 

ggc gtg aaa aaa gee gtg teg ggc ttg cag gtg cgc ctg att cgc gaa 2016 
Gly Val Lys Lys Ala Val Ser Gly Leu Gin Val Arg Leu He Arg Glu 
660 665 670 

cgc cgc gat tac tac tgg aac tgg tea gaa gat gaa ggc tgg cag tea 2064 
Arg Arg Asp Tyr Tyr Trp Asn Trp Ser Glu Asp Glu Gly Trp Gin Ser 
675 680 685 

cag ttt gat caa aaa gat ctg ate gaa aat gaa caa act ctg gat ctg 2112 
Gin Phe Asp Gin Lys Asp Leu He Glu Asn Glu Gin Thr Leu Asp Leu 
690 695 700 

aaa gcg gac gaa acc ggc aag gtc agt ttt ccg gta gag tgg ggc get 2160 
Lys Ala Asp Glu Thr Gly Lys Val Ser Phe Pro Val Glu Trp Gly Ala 
705 710 715 720 



tat cgt ctg gaa gtc aaa gcg ccg aat gaa gcg gtc agt agt gtt cgt 
Tyr Arg Leu Glu Val Lys Ala Pro Asn Glu Ala Val Ser Ser Val Arg 
725 730 735 



gtg cga ccc gac cgt gtc acg ctg aaa ctg gat aaa gec agt tat cgc 
Val Arg Pro Asp Arg val Thr Leu Lys Leu Asp Lys Ala Ser Tyr Arg 
755 760 765 



ggt tat gcg atg gtc gag tec agt gaa ggg ccg ctg tgg tgg caa gag 
Gly Tyr Ala Met Val Glu Ser Ser Glu Gly Pro Leu Trp Trp Gin Glu 



2208 



ttc tgg get ggc tat age tgg cag gac aac age gac ggt age ggc gca 2256 
Phe Trp Ala Gly Tyr Ser Trp Gin Asp Asn Ser Asp Gly Ser Gly Ala 
740 745 750 



2304 



cct ggc gac acc att aag ttg cat ate gee gcg cca acg gcg ggt aaa 2352 
Pro Gly Asp Thr He Lys Leu His He Ala Ala Pro Thr Ala Gly Lys 
770 775 780 



2400 
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785 790 795 800 

att gat gtt egg get caa ggg ctg gat ctg acg att ccg gtc gat aaa 2448 
He Asp Val Arg Ala Gin Gly Leu Asp Leu Thr He Pro Val Asp Lys 
805 810 815 

acc tgg aat cgt cat gat ctg tat tta agt acg ctg gtg gta cgt cct 2496 
Thr Trp Asn Arg His Asp Leu Tyr Leu Ser Thr Leu Val Val Arg Pro 
820 825 830 

ggc gat aaa tct cgc tec gcg acg cca aaa cgc gcg gtt ggt gtg ttg 2544 
Gly Asp Lys Ser Arg Ser Ala Thr Pro Lys Arg Ala Val Gly Val Leu 
835 840 845 

cat ctg ccg ctt ggc gat gaa aac cgt cgc etc gat ctg gcg ctg gaa 2592 
His Leu Pro Leu Gly Asp Glu Asn Arg Arg Leu Asp Leu Ala Leu Glu 
850 855 860 

aca cca gca aaa atg cgt ccc aat caa cca tta acc gtg aaa att aaa 2640 
Thr Pro Ala Lys Met Arg Pro Asn Gin Pro Leu Thr Val Lys He Lys 
865 870 875 880 * 

gec age act aaa aat ggc gag aag cct aaa cag gtg aat gtg ctg gtg 2688 
Ala Ser Thr Lys Asn Gly Glu Lys Pro Lys Gin Val Asn Val Leu Val 
885 890 895 

tct gec gtt gat agt ggt gtg ctg aat att act gac tac gtc acg cca 2736 
Ser Ala Val Asp Ser Gly Val Leu Asn He Thr Asp Tyr Val Thr Pro 
900 905 910 

gat ccg tgg cag gcg ttc ttt ggt cag aaa cgc tat ggc gca gac att 2784 
Asp Pro Trp Gin Ala Phe Phe Gly Gin Lys Arg Tyr Gly Ala Asp He 
915 920 925 

tac gat att tac ggt cag gtt att gaa ggt cag ggg cgt ctg gca get 2832 
Tyr Asp lie Tyr Gly Gin Val He Glu Gly Gin Gly Arg Leu Ala Ala 
930 935 940 

ctg cgt ttc ggt ggc gat ggt gat gag ctg aaa cgt ggt ggt aaa ccg 2880 
Leu Arg Phe Gly Gly Asp Gly Asp Glu Leu Lys Arg Gly Gly Lys Pro 
945 950 955 960 

ccg gtc aat cac gtc aat att gtc gtg cag cag gcg ctg ccg gta acg 2928 
Pro Val Asn His Val Asn He Val Val Gin Gin Ala Leu Pro Val Thr 
965 970 975 

etc aac gaa cag ggc gaa ggc teg gtt aca ctg ccg att ggc gat ttt 2976 
Leu Asn Glu Gin Gly Glu Gly Ser Val Thr Leu Pro He Gly Asp Phe 
980 985 990 

aac ggt gaa ttg cgc gtc atg gcg caa gee tgg acg gca gat gac ttc 3024 
Asn Gly Glu Leu Arg Val Met Ala Gin Ala Trp Thr Ala Asp Asp Phe 
995 1000 1005 

ggt age aac gaa agt aaa gtg ata gtt gee gca ccg gtg att get gaa 3072 
Gly Ser Asn Glu Ser Lys Val He Val Ala Ala Pro Val He Ala Glu 
1010 1015 1020 

ctg aac atg ccg cgc ttt atg gcg agt ggc gat acc teg cgt ctg acg 3120 
Leu Asn Met Pro Arg Phe Met Ala Ser Gly Asp Thr Ser Arg Leu Thr 
1025 1030 1035 1040 
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ctg gat ate act aat ctt acc gat aaa ccg caa aaa ctg aac gtt gec 3168 
Leu Asp lie Thr Asn Leu Thr Asp Lys Pro Gin Lys Leu Asn Val Ala 
1045 1050 1055 

ctg acc gec agt ggt ttg ctt gaa ctg gtc age gat tea ccc gca gec 3216 
Leu Thr Ala Ser Gly Leu Leu Glu Leu Val Ser Asp Ser Pro Ala Ala 
1060 1065 1070 

gtt gaa tta gcg cca ggt gtg cgt act acg ctg ttt ate ccg gtg cga 3264 
Val Glu Leu Ala Pro Gly Val Arg Thr Thr Leu Phe lie Pro Val Arg 
1075 1080 1085 

gca ttg ccg ggt tat ggc gat gga gaa att cag gec acc att age ggg 3312 
Ala Leu Pro Gly Tyr Gly Asp Gly Glu He Gin Ala Thr He Ser Gly 
1090 1095 1100 

tta gcg tta ccg ggt gaa acc gtt gee gat cag cat aag cag tgg aaa 3360 
Leu Ala Leu Pro Gly Glu Thr Val Ala Asp Gin His Lys Gin Trp Lys 
1105 1110 1115 1120 

ate ggc gtc cgt ccg gcg ttc ccg gca caa acg gtt aat tac ggt acg 3408 
He Gly Val Arg Pro Ala Phe Pro Ala Gin Thr Val Asn Tyr Gly Thr 
1125 1130 1135 

gcg tta cag cct ggt gag aca tgg gcg att ccg gcg gat gga ttg caa 3456 
Ala Leu Gin Pro Gly Glu Thr Trp Ala He Pro Ala Asp Gly Leu Gin 
1140 1145 1150 

aac ttc teg cct gtt acg ctg gaa ggg caa ttg ttg ttg age ggc aaa 3504 
Asn Phe Ser Pro Val Thr Leu Glu Gly Gin Leu Leu Leu Ser Gly Lys 
1155 1160 1165 

cca ccg ctg aac ate gca cgt tat ate aaa gag tta aaa gcg tat ccg 3552 
Pro Pro Leu Asn He Ala Arg Tyr He Lys Glu Leu Lys Ala Tyr Pro 
1170 1175 1180 

tac ggc tgt ctt gag caa acc gee age ggc ctg ttt ccg tea ctt tat 3600 
Tyr Gly Cys Leu Glu Gin Thr Ala Ser Gly Leu Phe Pro Ser Leu Tyr 
1185 1190 1195 1200 

acc aac gca gee caa ctg cag gcg ttg ggc ate aaa ggc gac agt gat 3648 
Thr Asn Ala Ala Gin Leu Gin Ala Leu Gly He Lys Gly Asp Ser Asp 
1205 1210 1215 

gag aaa cgc cgt gca teg gtc gat ate ggc att tec cgt ttg ctg caa 3696 
Glu Lys Arg Arg Ala Ser Val Asp He Gly He Ser Arg Leu Leu Gin 
1220 1225 1230 

atg caa cgt gat aac ggc ggc ttt gcg ctg tgg gat aaa aac ggt gac 3744 
Met Gin Arg Asp Asn Gly Gly Phe Ala Leu Trp Asp Lys Asn Gly Asp 
1235 1240 1245 

gaa gag tac tgg ctg acg get tac gtg atg gat ttc ctg gtc cgc gca 3792 
Glu Glu Tyr Trp Leu Thr Ala Tyr Val Met Asp Phe Leu Val Arg Ala 
1250 1255 1260 

ggc gaa cag ggt tac age gtg ccg aca gac gee att aac egg ggt aat 3840 
Gly Glu Gin Gly Tyr Ser Val Pro Thr Asp Ala He Asn Arg Gly Asn 
1265 1270 1275 1280 
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gag cgt ctg ctg cgc tat tta caa gat ccg ggc atg atg teg ate ccg 3888 
Glu Arg Leu Leu Arg Tyr Leu Gin Asp Pro Gly Met Met Ser lie Pro 
1285 1290 1295 

tac gcg gat aat etc aaa gee agt aaa ttc gee gta cag tct tac get 3936 
Tyr Ala Asp Asn Leu Lys Ala Ser Lys Phe Ala Val Gin Ser Tyr Ala 
1300 1305 1310 

gcg ctg gtg ttg gec cgt cag caa aag get ccg ctg ggt gcg ctg cgt 3984 
Ala Leu Val Leu Ala Arg Gin Gin Lys Ala Pro Leu Gly Ala Leu Arg 
1315 1320 1325 

gaa ate tgg gag cat cgt gca gat gec get tct ggt tta ccg ctg ctg 4032 
Glu lie Trp Glu His Arg Ala Asp Ala Ala Ser Gly Leu Pro Leu Leu 
' 1330 1335 1340 

caa ctt ggc gtt gcg ctg aaa acc atg ggt gat gcg acg cgt ggt gaa 4080 
Gin Leu Gly Val Ala Leu Lys Thr Met Gly Asp Ala Thr Arg Gly Glu 
1345 1350 1355 1360 

gaa gcg att gcg ctg gcg ctg aaa acg ccg cgt aat agt gat gag egg 4128 
Glu Ala He Ala Leu Ala Leu Lys Thr Pro Arg Asn Ser Asp Glu Arg 
1365 1370 1375 

ata tgg ctg ggt gat tac ggt agt tea ctg cgc gac aac gcg tta atg 4176 
He Trp Leu Gly Asp Tyr Gly Ser Ser Leu Arg Asp Asn Ala Leu Met 
1380 1385 1390 

etc tec ttg ctg gaa gaa aat aaa ctg eta ccc gat gag cag tac act 4224 
Leu Ser Leu Leu Glu Glu Asn Lys Leu Leu Pro Asp Glu Gin Tyr Thr 
1395 1400 1405 

ttg ctg aac aca ctt teg cag cag gcg ttt ggt gaa cgc tgg eta teg 42 72 
Leu Leu Asn Thr Leu Ser Gin Gin Ala Phe Gly Glu Arg Trp Leu Ser 
1410 1415 1420 

acg cag gaa agt aac gcg ttg ttc ctg get gec cgt acg att cag gat 4320 
Thr Gin Glu Ser Asn Ala Leu Phe Leu Ala Ala Arg Thr He Gin Asp 
1425 1430 1435 1440 

tta ccc ggt aaa tgg cag gcg caa acc tct ttc tea get gag cag ctg 4368 
lieu Pro Gly Lys Trp Gin Ala Gin Thr Ser Phe Ser Ala Glu Gin Leu 
1445 1450 1455 

aca ggc gag aaa gcg caa aac age aat ctg aat age gat caa ctt gtc 4416 
Thr Gly Glu Lys Ala Gin Asn Ser Asn Leu Asn Ser Asp Gin Leu Val 
1460 1465 1470 

acc ttg cag gtg age aac agt ggc gat cag ccg tta tgg ttg cgt atg 4464 
Thr lieu Gin Val Ser Asn Ser Gly Asp Gin Pro Leu Trp Leu Arg Met 
1475 1480 1485 

gat gec age ggt tat ccg caa tec gca cct tta ccg gcg aac aat gtg 4512 
Asp Ala Ser Gly Tyr Pro Gin Ser Ala Pro Leu Pro Ala Asn Asn Val 
1490 1495 1500 

ctg caa ate gag cgt cat att ctt ggt act gat ggt aag age aaa teg 4560 
Leu Gin He Glu Arg His He Leu Gly Thr Asp Gly Lys Ser Lys Ser 
1505 1510 , 1515 1520 

ctg gac teg tta cgt age ggc gat ctg gtg ctg gtg tgg ttg cag gta 4608 
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Leu Asp Ser Leu Arg Ser Gly Asp Leu Val Leu Val Trp Leu Gin Val 
1525 1530 1535 

aaa gcc agt aac age gtg ccg gat gcg tta gtc gtg gat ctg ctg cct 4656 
Lys Ala Ser Asn Ser Val Pro Asp Ala Leu Val Val Asp Leu Leu Pro 
1540 1545 1550 

gcg ggt ctg gaa ctg gaa aac cag aat ctg gcg aac ggt age gcc age 4704 
Ala Gly Leu Glu Leu Glu Asn Gin Asn Leu Ala Asn Gly Ser Ala Ser 
1555 1560 1565 

ctg gag caa agt ggt ggc gaa gtg cag aac tta ctg aac cag atg cag 4752 
Leu Glu Gin Ser Gly Gly Glu Val Gin Asn Leu Leu Asn Gin Met Gin 
1570 1575 1580 

cag gcg age att aag cac att gag ttc cgt gac gat cgc ttt gtg gcg 4800 
Gin Ala Ser He Lys His He Glu Phe Arg Asp Asp Arg Phe Val Ala 
1585 1590 1595 .1600 

gcg gtt gcc gtt gat gaa tac caa ccg gta acg ctg gtg tat ctg gcg 4848 
Ala Val Ala Val Asp Glu Tyr Gin Pro Val Thr Leu Val Tyr Leu Ala 
1605 1610 1615 

egg gcg gtg acg ccg gga acg tat cag gta ccg caa ccg atg gtg gaa 4896 
Arg Ala Val Thr Pro Gly Thr Tyr Gin Val Pro Gin Pro Met Val Glu 
1620 1625 1630 

tea atg tat gtt ccc caa tgg egg gcg ace ggc gcg get gaa gat ctg 4944 
Ser Met Tyr Val Pro Gin Trp Arg Ala Thr Gly Ala Ala Glu Asp Leu 
1635 1640 1645 

ctg att gtc aga ccg taa 4962 
Leu He Val Arg Pro * 
1650 



<210> 210 
<211> 795 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (795) 

<400> 210 

atg gat ctg ttg tac egg gta aaa acg 
Met Asp Leu Leu Tyr Arg Val Lys Thr 
1 5 

aat cat tac ace tgg cct gcc ate gat 
Asn His Tyr Thr Trp Pro Ala He Asp 
20 25 

cat ttt cat ctg att ggc agt att cat 
His Phe His Leu He Gly Ser He His 
35 40 

ccc ctg ccc ace cgt ttg etc aaa aag 
Pro Leu Pro Thr Arg Leu Leu Lys Lys 
50 55 



ctt tgg gcc gcg ctg cgc ggt 48 
Leu Trp Ala Ala Leu Arg Gly 
10 . 15 

ate acc etc ccc ggc aat cgc 96 
He Thr Leu Pro Gly Asn Arg 
30 

atg ggt age cac gat atg get 144 
Met Gly Ser His Asp Met Ala 
45 

etc aaa aac gcc gat gcg ctg 192 
Leu Lys Asn Ala Asp Ala Leu 
60 
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ate gtc gag gcg gat gtt tec acc age gat acg cct ttt get aat ttg 240 
lie Val Glu Ala Asp Val Ser Thr Ser Asp Thr Pro Phe Ala Asn Leu 
65 70 75 80 

cct gee tgc gag gcg ctg gaa gag cgc att age gag gag caa etc caa 288 
Pro Ala Cys Glu Ala Leu Glu Glu Arg lie Ser Glu Glu Gin Leu Gin 
85 90 95 

aac ctg cag cac att age cag gag atg ggc att tct ccc tea etc ttt 336 
Asn Leu Gin His lie Ser Gin Glu Met Gly lie Ser Pro Ser Leu Phe 
100 105 110 

tct acc caa ccg ctg tgg caa ate gcg atg gtt ctt cag gcg acg cag 384 
Ser Thr Gin Pro Leu Trp Gin lie Ala Met Val Leu Gin Ala Thr Gin 
115 120 125 

gca caa aaa ctg ggg ctg egg gca gaa tac ggt ate gat tac cag eta 432 
Ala Gin Lys Leu Gly Leu Arg Ala Glu Tyr Gly lie Asp Tyr Gin Leu 
130 135 140 

ttg cag gcg gcg aag caa caa cat aaa ccc gtg att gaa ctg gaa ggg 480 
Leu Gin Ala Ala Lys Gin Gin His Lys Pro Val lie Glu Leu Glu Gly 
145 150 155 160 

get gaa aac cag att gee atg ttg etc cag etc cct gac aaa gga ctg 528 
Ala Glu Asn Gin He Ala Met Leu Leu Gin Leu Pro Asp Lys Gly Leu 
165 170 175 

geg ctg ctg gac gat acg ctg acc cac tgg cat acc aac gca egg ttg 576 
Ala Leu Leu Asp Asp Thr Leu Thr His Trp His Thr Asn Ala Arg Leu 
180 185 190 

ctg caa caa atg atg age tgg tgg ctg aat gca ccg ccg caa aat aat 624 
Leu Gin Gin Met Met Ser Trp Trp Leu Asn Ala Pro Pro Gin Asn Asn 
195 200 205 

gat ata acg ctg ccc aat acg ttc agt cag teg ctg tac gat gtg ctg 672 
Asp lie Thr Leu Pro Asn Thr Phe Ser Gin Ser Leu Tyr Asp Val Leu 
210 215 220 

atg cat cag cga aat etc gee tgg egg gat aaa tta cgc gec atg ccg 720 
Met His Gin Arg Asn Leu Ala Trp Arg Asp Lys Leu Arg Ala Met Pro 
225 230 235 240 

ccg ggg cga tat gtg gtc gcg gtc ggt gca eta cac ctg tat gga gaa 768 
Pro Gly Arg Tyr Val Val Ala Val Gly Ala Leu His Leu Tyr Gly Glu 
245 250 255 

ggg aat ttg ccg caa atg ttg cgc taa 795 
Gly Asn Leu Pro Gin Met Leu Arg * 
260 



<210> 211 
<211> 969 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . . . (969) 
<400> 211 

atg gat aat ttt ctt get ctg acc tta acg ggt aaa aaa ccg gtt ate 48 
Met Asp Asn Phe Leu Ala Leu Thr Leu Thr Gly Lys Lys Pro Val He 
1 5 10 15 

acc gag cga gaa ate aac ggc gtt cgc tgg cgc tgg ctg ggc gat ggt 96 
Thr Glu Arg Glu He Asn Gly Val Arg Trp Arg Trp Leu Gly Asp Gly 
20 25 30 

gtg ctg gaa ctg acg cca tta acg cca ccg caa ggc gca ctg gtg att 144 
Val Leu Glu Leu Thr Pro Leu Thr Pro Pro Gin Gly Ala Leu Val He 
35 40 45 

tea gcg gga ata cac ggt aat gag acg gca cct gtg gag atg ctg gac 192 
Ser Ala Gly He His Gly Asn Glu Thr Ala Pro Val Glu Met Leu Asp 
50 55 60 

gcg ttg ctt ggc gcg ata tct cac ggc gag ate ccg tta cgt tgg egg 240 
Ala Leu Leu Gly Ala He Ser His Gly Glu He Pro Leu Arg Trp Arg 
65 70 75 80 

ttg ctg gtg ate etc ggg aat cct cct gcg ctg aag caa ggg aaa cgt 288 
Leu Leu Val. He Leu Gly Asn Pro Pro Ala Leu Lys Gin Gly Lys Arg 
85 90 95 

tat tgc cat age gat atg aat cga atg ttt ggc ggt cgt tgg cag eta 336 
Tyr Cys His Ser Asp Met Asn Arg Met Phe Gly Gly Arg Trp Gin Leu 
100 105 110 

ttt get gaa age gga gaa acc tgt egg gcg cgc gaa ctg gaa cag tgc 384 
Phe Ala Glu Ser Gly Glu Thr Cys Arg Ala Arg Glu Leu Glu Gin Cys 
115 120 125 

ctg gaa gat ttt tat gac cag ggc aaa gaa tct gtg cgc tgg cac ctt 432 
Leu Glu Asp Phe Tyr Asp Gin Gly Lys Glu Ser Val Arg Trp His Leu 
130 135 140 

gat eta cat acc gca att cgt ggc tec ttg cat ccg cag ttc ggt gta 480 
Asp Leu His Thr Ala He Arg Gly Ser Leu His Pro Gin Phe Gly Val 
145 150 155 160 

tta ccg caa cgc gac att ccc tgg gac gag aaa ttt ctg acg tgg ctg 528 
Leu Pro Gin Arg Asp He Pro Trp Asp Glu Lys Phe Leu Thr Trp Leu 
165 170 175 

ggt gcg gcg ggg ctg gag gcg ctg gtg ttc cat cag gaa cct ggt ggt 576 
Gly Ala Ala Gly Leu Glu Ala Leu Val Phe His Gin Glu Pro Gly Gly 
180 185 190 

acg ttt acc cat ttc age gec aga cat ttt ggc gcg ctg gec tgt acg 624 
Thr Phe Thr His Phe Ser Ala Arg His Phe Gly Ala Leu Ala Cys Thr 
195 200 205 

ctg gaa ctt ggc aaa gcg ttg ccc ttt ggg caa aac gat ctt cgc cag 672 
Leu Glu Leu Gly Lys Ala Leu Pro Phe Gly Gin Asn Asp Leu Arg Gin 
210 215 220 

i 

ttt gca gta act gec age gca att get gcg ctg eta tct ggt gag agt 720 
Phe Ala Val Thr Ala Ser Ala He Ala Ala Leu Leu Ser Gly Glu Ser 
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225 230 235 240 

gtc ggt ate gtg aga aca ccg ccg etc cgt tat egg gtg gtt teg caa 768 
Val Gly lie Val Arg Thr Pro Pro Leu Arg Tyr Arg Val Val Ser Gin 
245 250 255 

att act cgc cac teg ccg tec ttc gaa atg cat atg gca agt gac acg 816 
lie Thr Arg His Ser Pro Ser Phe Glu Met His Met Ala Ser Asp Thr 
260 265 270 

ctg aat ttt atg ccg ttt gag aaa gga aca ttg ctg gcg cag gac gga 864 
Leu Asn Phe Met Pro Phe Glu Lys Gly Thr Leu Leu Ala Gin Asp Gly 
275 280 285 

gag gaa cgt ttt ace gta acc cat gat gta gag tat gtg tta ttc cct 912 
Glu Glu Arg Phe Thr Val Thr His Asp Val Glu Tyr Val Leu Phe Pro 
290 295 300 

aat ccg ttg gta gcg ttg gga tta cgc gcg gga tta atg etc gaa aaa 960 
Asn Pro Leu Val Ala Leu Gly Leu Arg Ala Gly Leu Met Leu Glu Lys 
305 310 315 320 

ata age taa 969 
He Ser * 



<210> 212 
<211> 1344 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1344) 

<400> 212 

atg aac gec tgg gaa gtc aat ttc gac ggg ctg gta ggg ctg acg cat 48 

Met Asn Ala Trp Glu Val Asn Phe Asp Gly Leu Val Gly Leu Thr His 
1 5 10 " 15 

cat tac gcg ggc ctg teg ttt ggt aat gaa gee tct acc cgt cac cgt 96 
His Tyr Ala Gly Leu Ser Phe Gly Asn Glu Ala Ser Thr Arg His Arg 
20 25 30 

ttt cag gtg tct aac ccg cga ctg gcg gcg aag cag ggc tta ctg aaa 144 
Phe Gin Val Ser Asn Pro Arg Leu Ala Ala Lys Gin Gly Leu Leu Lys 
35 40 45 

atg aaa gee ctt gee gat gcg gga ttc ccc cag gee gtg ate ccg ccg 192 
Met Lys Ala Leu Ala Asp Ala Gly Phe Pro Gin Ala Val He Pro Pro 
50 55 60 

cac gag cgt ccg ttt att ccg gtg ctg cgt cag ttg gga ttc agt ggt 240 
His Glu Arg Pro Phe He Pro Val Leu Arg Gin Leu Gly Phe Ser Gly 
65 70 75 80 

age gat gag cag gta ctg gaa aaa gtt gca cgc cag gca ccg cac tgg 288 
Ser Asp Glu Gin Val Leu Glu Lys Val Ala Arg Gin Ala Pro His Trp 
85 90 95 
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ctt tec age gtc agt tec get teg cca atg tgg gta gee aat gcg gca 336 
Leu Ser Ser Val Ser Ser Ala Ser Pro Met Trp Val Ala Asn Ala Ala 
X00 105 110 

acg ate gcg cca tct gec gat acg ctg gat ggc aaa gtg cat etc ace 384 
Thr lie Ala Pro Ser Ala Asp Thr Leu Asp Gly Lys Val His Leu Thr 
115 120 125 

gtt gec aac ctg aac aat aaa ttt cac cgt teg ctg gaa gcg ccc gtc 432 
Val Ala Asn Leu Asn Asa Lys Phe His Arg Ser Leu Glu Ala Pro Val 
130 135 140 

act gaa teg ctg tta aaa gcg att ttt aac gac gaa gag aaa ttt age 480 
Thr Glu Ser Leu Leu Lys Ala lie Phe Asn Asp Glu Glu Lys Phe Ser 
145 150 155 160 

gtc cat teg gcg ttg cca cag gta gcg ttg etc ggt gat gag ggg gcg 528 
Val His Ser Ala Leu Pro Gin Val Ala Leu Leu Gly Asp Glu Gly Ala 
165 170 175 

gca aac cac aat cgt etc ggc ggt cat tac ggt gaa ccg ggt atg caa 576 
Ala Asn His Asn Arg Leu Gly Gly His Tyr Gly Glu Pro Gly Met Gin 
180 185 190 

ctt ttt gtc tac ggg cga gaa gaa ggc aat gat ace egg cct tec cgt 624 
Leu Phe Val Tyr Gly Arg Glu Glu Gly Asn Asp Thr Arg Pro Ser Arg 
195 200 205 

tat ccg gcg cga cag act cgc gaa gee age gag gcg gtg gca agg ctg 672 
Tyr Pro Ala Arg Gin Thr Arg Glu Ala Ser Glu Ala Val Ala Arg Leu 
210 215 220 

aat cag gtg aat ccc caa cag gtg att ttc gec cag caa aac ccg gac 720 
Asn Gin Val Asn Pro Gin Gin Val lie Phe Ala Gin Gin Asn Pro Asp 
225 230 235 240 

gtt ate gac cag ggc gtt ttt cat aat gac gtg att gee gtg agt aac 768 
Val He Asp Gin Gly Val Phe His Asn Asp Val He Ala Val Ser Asn 
245 250 255 

cgc cag gtg ctg ttt tgc cac caa cag gcg ttc get cgc cag tea cag 816 
Arg Gin Val Leu Phe Cys His Gin Gin Ala Phe Ala Arg Gin Ser Gin 
260 265 270 

tta ctg gca aac ctg cgt gcg egg gtc aat ggt ttt atg gcg ata gaa 864 
Leu Leu Ala Asn Leu Arg Ala Arg Val Asn Gly Phe Met Ala He Glu 
275 280 285 

gtt ccg gca act cag gtt tec gtg tct gat acg gtg tct acc tat ctg 912 
Val Pro Ala Thr Gin Val Ser Val Ser Asp Thr Val Ser Thr Tyr Leu 
290 295 300 

ttt aac age caa ctg ctg age cgc gat gat ggt tec atg atg ttg gtg 960 
Phe Asn Ser Gin Leu Leu Ser Arg Asp Asp Gly Ser Met Met Leu Val 
305 310 315 320 

ctg cct cag gag tgt egg gaa cac gec gga gta tgg ggt tat etc aat 1008 
Leu Pro Gin Glu Cys Arg Glu His Ala Gly Val Trp Gly Tyr Leu Asn 
325 330 335 

gaa etc ctt gec get gac aac ccg att age gaa eta aaa gtc ttt gat 1056 
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Glu Leu Leu Ala Ala Asp Asn Pro lie Ser Glu Leu Lys Val Phe Asp 
340 345 350 

tta cgt gaa age atg gcg aat ggc ggc ggc ccg gcg tgc ctg egg ttg 1104 
Leu Arg Glu Ser Met Ala Asn Gly Gly Gly Pro Ala Cys Leu Arg Leu 
355 360 365 

c 99 gtg gta ttg aca gaa gaa gaa cgc egg gcg gtg aat ccg gcg gtg 1152 
Arg Val Val Leu Thr Glu Glu Glu Arg Arg Ala Val Asn Pro Ala Val 
370 375 380 

atg atg aac gat acg ctg ttt aat gcg etc aat gac tgg gtg gat cgt 1200 
Met Met Asn Asp Thr Leu Phe Asn Ala Leu Asn Asp Trp Val Asp Arg 
385 390 395 400 

tac tac cgc gat cgc ctt act get gee gat ctg gee gac ccg caa ttg 1248 
Tyr Tyr Arg Asp Arg Leu Thr Ala Ala Asp Leu Ala Asp Pro Gin Leu 
405 410 415 

ctg cgc gaa ggg egg gaa gca ctg gat gta ttg age caa tta ctg aat 1296 
Leu Arg Glu Gly Arg Glu Ala Leu Asp Val Leu Ser Gin Leu Leu Asn 
420 425 430 

etc ggt teg gtt tat ccg ttc cag cgc gag gga ggg ggc aat gga taa 1344 
Leu Gly Ser Val Tyr Pro Phe Gin Arg Glu Gly Gly Gly Asn Gly * 
435 440 445 



<210> 213 
<211> 1479 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1479) 

c400> 213 

atg act tta tgg att aac ggt gac tgg ata acg ggc cag ggc gca teg 48 
Met Thr Leu Trp lie Asn Gly Asp Trp lie Thr Gly Gin Gly Ala Ser 
15 10 15 

cgt gtg aag cgt aat ccg gta teg ggc gag gtg tta tgg caa ggc aat 96 
Arg Val Lys Arg Asn Pro Val Ser Gly Glu Val Leu Trp Gin Gly Asn 
20 25 30 

gat gec gat gee get cag gtc gag cag get tgt egg gca gee cgt gcg 144 
Asp Ala Asp Ala Ala Gin Val Glu Gin Ala Cys Arg Ala Ala Arg Ala 
35 40 45 

gcg ttt ccg cgc tgg gcg egg ctt tea ttt get gaa cgt cat gec gtt 192 
Ala Phe Pro Arg Trp Ala Arg Leu Ser Phe Ala Glu Arg His Ala Val 
50 55 60 

gtc gaa cgc ttt gee gca ctg ctg gaa age aat aaa gee gaa tta ace 240 
Val Glu Arg Phe Ala Ala Leu Leu Glu Ser Asn Lys Ala Glu Leu Thr 
65 70 75 80 

gcg att att gec aga gaa acg ggt aag ccg cgc tgg gaa gcg gca acc 288 
Ala lie He Ala Arg Glu Thr Gly Lys Pro Arg Trp Glu Ala Ala Thr 
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85 90 95 

gaa gtg acg gcg atg ate aat aaa ate gcg ata tea att aag gcg tat 336 
Glu Val Thr Ala Met He Asn Lys lie Ala He Ser He Lys Ala Tyr 
100 105 110 

cac gtt cgt ace ggc gag cag cgt agt gaa atg ccg gac ggc gcg gcg 384 
His Val Arg Thr Gly Glu Gin Arg Ser Glu Met Pro Asp Gly Ala Ala 
115 120 125 

age ctg cga cat cgc ccg cac ggc gtg ctg gcg gtg ttt ggg ccg tat 432 
Ser Leu Arg His Arg Pro His Gly Val Leu Ala Val Phe Gly Pro Tyr 
130 135 140 

aat ttc cct ggt cat ttg ccg aac gga cat ate gtt ccg gca ttg ctg 480 
Asn Phe Pro Gly His Leu Pro Asn Gly His He Val Pro Ala Leu Leu 
145 150 155 160 

gca ggt aac acc att ate ttt aaa ccc age gaa ctg aca ccg tgg agt 528 
Ala Gly Asn Thr He He Phe Lys Pro Ser Glu Leu Thr Pro Trp Ser 
165 170 175 

ggc gaa gcg gta atg cgt tta tgg cag cag get ggc ttg ccg cca ggc 576 
Gly Glu Ala Val Met Arg Leu Trp Gin Gin Ala Gly Leu Pro Pro Gly 
180 185 190 

gtg ctg aac ctg gtg cag ggc ggg cgt gaa acg ggt cag gcg ctg agt 624 
Val Leu Asn Leu Val Gin Gly Gly Arg Glu Thr Gly Gin Ala Leu Ser 
195 200 205 

gcg ctg gag gat etc gac ggt ttg ctg ttt acc ggt age gee aat aca 672 
Ala Leu Glu Asp Leu Asp Gly Leu Leu Phe Thr Gly Ser Ala Asn Thr 
210 ~ 215 220 

ggc tac cag ttg cat cgc cag etc tec ggt cag ccg gag aaa att etc 720 
Gly Tyr Gin Leu His Arg Gin Leu Ser Gly Gin Pro Glu Lys He Leu 
225 230 235 240 

gee ctt gag atg ggc ggt aat aat ccg eta att ate gat gag gtg gcg 768 
Ala Leu Glu Met Gly Gly Asn Asn Pro Leu He He Asp Glu Val Ala 
245 250 255 

gat ate gac gcg get gtc cat ctg acc att cag teg gcg ttt gtc aca 816 
Asp He Asp Ala Ala Val His Leu Thr He Gin Ser Ala Phe Val Thr 
260 265 270 

gee ggt caa cgc tgc acc tgc gec cgc cgt tta ttg ctg aaa age ggg 864 
Ala Gly Gin Arg Cys Thr Cys Ala Arg Arg Leu Leu Leu Lys Ser Gly 
275 280 285 

gcg cag ggc gat gcg ttt ctt get cgt ctg gtt gec gtc age cag cga 912 
Ala Gin Gly Asp Ala Phe Leu Ala Arg Leu Val Ala Val Ser Gin Arg 
290 295 300 

tta acg ccg ggc aac tgg gat gac gaa ccg cag ccg ttt att ggc ggg 960 
Leu Thr Pro Gly Asn Trp Asp Asp Glu Pro Gin Pro Phe He Gly Gly 
305 310 315 320 

ctg att tct gaa cag gee gca cag cag gtg gtt act gca tgg cag caa 1008 
Leu He Ser Glu Gin Ala Ala Gin Gin Val Val Thr Ala Trp Gin Gin 
325 330 335 
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ctg gaa gcg atg ggc gga cga ccc ctg ctt gcg ccg cgc tta tta caa 1056 
Leu Glu Ala Met Gly Gly Arg Pro Leu Leu Ala Pro Arg Leu Leu Gin 
340 345 350 

gca ggg aca teg ttg ctg acg ccg ggg ate att gaa atg aca ggc gtt 1104 
Ala Gly Thr Ser Leu Leu Thr Pro Gly lie lie Glu Met Thr Gly Val 
355 360 365 

get ggc gta cca gat gaa gag gtg ttc gga ccg tta ttg cgc gtc tgg 1152 
Ala Gly Val Pro Asp Glu Glu Val Phe Gly Pro Leu Leu Arg Val Trp 
370 375 380 

cgt tat gat act ttc gat gaa gcg att cga atg gcg aat aac act cgc 1200 
Arg Tyr Asp Thr Phe Asp Glu Ala lie Arg Met Ala Asn Asn Thr Arg 
385 390 395 400 

ttc gga etc tct tgc ggt ctg gtt tec ccc gag egg gaa aag ttc gat 1248 
Phe Gly Leu Ser Cys Gly Leu Val Ser Pro Glu Arg Glu Lys Phe Asp 
405 410 415 

caa ctg ttg ctg gag gcg egg gcg ggg att gtt aac tgg aac aaa ccg 1296 
Gin Leu Leu Leu Glu Ala Arg Ala Gly lie Val Asn Trp Asn Lys Pro 
420 425 430 

ctt ace ggt get gee agt acc gcg cca ttc ggc ggc att ggt get tec 1344 
Leu Thr Gly Ala Ala Ser Thr Ala Pro Phe Gly Gly lie Gly Ala Ser 
435 440 445 

ggt aac cat cgc ccc age gee tgg tat gee gca gat tac tgc gca tgg 1392 
Gly Asn His Arg Pro Ser Ala Trp Tyr Ala Ala Asp Tyr Cys Ala Trp 
450 455 460 

ccg atg gcg age ctg gag teg gac teg tta aca ttg ccc gec acg ctt 1440 
Pro Met Ala Ser Leu Glu Ser Asp Ser Leu Thr Leu Pro Ala Thr Leu 
465 470 475 480 

aac ccc ggg ctg gat ttt tec gat gag gtg gtg cga tga 1479 
Asn Pro Gly Leu Asp Phe Ser Asp Glu Val Val Arg * 
485 490 



<210> 214 
<211> 1035 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1035) 

<400> 214 

atg atg gtc ate cgt ccc gtt gag cga tea gat gtc teg gcg ctg atg 48 
Met Met Val He Arg Pro Val Glu Arg Ser Asp Val Ser Ala Leu Met 
15 10 15 

cag ctt gee age aaa acg ggc ggc ggc ctg acg teg ctt ccc gee aat 96 
Gin Leu Ala Ser Lys Thr Gly Gly Gly Leu Thr Ser Leu Pro Ala Asn 
20 25 30 

gaa gee acg ctt teg gcg cgt ate gaa agg gca ate aaa acc tgg caa 144 
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Glu Ala Thr Leu Ser Ala Arg lie Glu Arg Ala He Lys Thr Trp Gin 
35 40 45 

ggc gaa ctg ccc aaa agt gag cag ggc tat gtg ttc gtg ctg gaa gat 192 
Gly Glu Leu Pro Lys Ser Glu Gin Gly Tyr Val Phe Val Leu Glu Asp 
50 ^ 55 60 

1 age gag aca ggc acc gtg gcg ggg att tgt gec att gag gtg gcg gtt 240 
Ser Glu Thr Gly Thr Val Ala Gly He Cys Ala He Glu Val Ala Val 
65 70 75 80 

ggg ctg aac gat ccc tgg tac aac tat cgc gtc ggc acg ttg gtt cac 288 
Gly Leu Asn Asp Pro Trp Tyr Asn Tyr Arg Val Gly Thr Leu Val His 
85 90 95 

gec tea aaa gag ctg aat gtc tat aac gca ttg ccg acg ctg ttt etc 336 
Ala Ser Lys Glu Leu Asn Val Tyr Asn Ala Leu Pro Thr Leu Phe Leu 
100 105 110 

agt aac gat cac acc ggc age age gag ctg tgc acg ctg ttt etc gac 384 
Ser Asn Asp His Thr Gly Ser Ser Glu Leu Cys Thr Leu Phe Leu Asp 
115 120 125 

ccg gac tgg cgc aaa gag ggc aac ggc tat ttg ctg teg aaa teg cgc 432 
Pro Asp Trp Arg Lys Glu Gly Asn Gly Tyr Leu Leu Ser Lys Ser Arg 
130 135 140 

ttt atg ttt atg gcg get ttt cgc gac aag ttt aat gac aaa gtg gtt 480 
Phe Met Phe Met Ala Ala Phe Arg Asp Lys Phe Asn Asp Lys Val Val 
145 150 155 160 

get gaa atg cgc ggg gtg att gac gaa cac ggc tat tea ccg ttc tgg 528 
Ala Glu Met Arg Gly Val He Asp Glu His Gly Tyr Ser Pro Phe Trp 
165 170 175 

caa age etc ggt aaa cgc ttc ttt teg atg gat ttt age cgc gee gat 576 
Gin Ser Leu Gly Lys Arg Phe Phe Ser Met Asp Phe Ser Arg Ala Asp 
180 185 190 

ttt etc tgc ggc acc ggg caa aag gca ttt att gca gaa ctg atg ccg 624 
Phe Leu Cys Gly Thr Gly Gin Lys Ala Phe He Ala Glu Leu Met Pro 
195 200 205 

aaa cat ccg ate tat acc cac ttt tta tec cag gaa gec cag gac gtc 672 
Lys His Pro He Tyr Thr His Phe Leu Ser Gin Glu Ala Gin Asp Val 
210 215 220 

ate ggt cag gta cat ccg caa acc gcg cct gec cgc gcg gtg ctg gag 720 
He Gly Gin Val His Pro Gin Thr Ala Pro Ala Arg Ala Val Leu Glu 
225 230 235 240 

aaa gaa ggt ttt cgc tac cgt aac tat ate gac ate ttt gac ggt ggg 768 
Lys Glu Gly Phe Arg Tyr Arg Asn Tyr He Asp He Phe Asp Gly Gly 
245 250 255 

ccg acg ctt gag tgt gac ate gac cgc gtg cgc gee ate cgt aaa agt 816 
Pro Thr Leu Glu Cys Asp He Asp Arg Val Arg Ala He Arg Lys Ser 
260 265 270 

egg ctg gtg gaa gtg gca gaa ggg cag cct gcg cag ggc gat ttc cca 864 
Arg Leu Val Glu Val Ala Glu Gly Gin Pro Ala Gin Gly Asp Phe Pro 
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275 280 285 

gcc tgc ctg gtc gcc aat gaa aat tat cac cat ttc cgc gtg gtg ctg 912 
Ala Cys Leu Val Ala Asn Glu Asn Tyr His His Phe Arg Val Val Leu 
290 295 300 

gtg cgt acc gat ccg gca acc gag cgt ttg att tta acc gcc gca caa 960 
Val Arg Thr Asp Pro Ala Thr Glu Arg Leu He Leu Thr Ala Ala Gin 
305 310 315 320 

ctg gat gcc etc aaa tgc cac gcc ggg gat cgc gtt cgt ctg gtg cgc 1008 
Leu Asp Ala Leu Lys Cys His Ala Gly Asp Arg Val Arg Leu Val Arg 
325 330 335 

ctg tgc gca gag gag aaa aca gca tga 1035 
Leu Cys Ala Glu Glu Lys Thr Ala * 
340 



<210> 215 
<211> 1221 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1221) 

<400> 215 

atg tct cag cca att acg cgt gaa aac ttt gat gaa tgg atg ata cct 48 

Met Ser Gin Pro He Thr Arg Glu Asn Phe Asp Glu Trp Met He Pro 
15 10 15 

gtt tac get ccg gca ccc ttt ata ccg gta cgt ggc gaa ggt teg cgc 96 
Val Tyr Ala Pro Ala Pro Phe He Pro Val Arg Gly Glu Gly Ser Arg 
20 25 30 

ttg tgg gat cag cag ggg aaa gag tat ate gac ttc gcg ggt ggc att 144 
Leu Trp Asp Gin Gin Gly Lys Glu Tyr He Asp Phe Ala Gly Gly He 
35 40 45 

gcg gtg aac gcg ctg ggc cat gcg cat ccg gaa ctg cgt gaa gcg ctg 192 
Ala Val Asn Ala Leu Gly His Ala His Pro Glu Leu Arg Glu Ala Leu 
50 55 60 

aac gaa cag gcg agt aag ttc tgg cat acc ggc aac ggt tac acc aac 240 
Asn Glu Gin Ala Ser Lys Phe Trp His Thr Gly Asn Gly Tyr Thr Asn 
65 70 75 80 

gag ccg gta ctg cga ctg gcg aaa aaa ttg ate gac gcc acg ttt gcc 288 
Glu Pro Val Leu Arg Leu Ala Lys Lys Leu He Asp Ala Thr Phe Ala 
85 90 95 

gat cgc gtc ttc ttt tgt aac tec ggt gcg gaa gcc aac gaa gcg gcg 336 
Asp Arg Val Phe Phe Cys Asn Ser Gly Ala Glu Ala Asn Glu Ala Ala 
100 105 110 

eta aaa ctg gcg cgt aaa ttc get cac gac cgc tac ggc age cat aag 384 
Leu Lys Leu Ala Arg Lys Phe Ala His Asp Arg Tyr Gly Ser His Lys 
115 120 125 
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age ggc ate gtg gcg ttc aaa aat gcg ttt cat ggt cgc acg ctg ttt 432 
Ser Gly lie Val Ala Phe Lys Asn Ala Phe His Gly Arg Thr Leu Phe 
130 135 140 

act gtc agt gcg ggt ggg cag cca gec tat tea cag gat ttt gcg cca 480 
Thr Val Ser Ala Gly Gly Gin Pro Ala Tyr Ser Gin Asp Phe Ala Pro 
145 150 155 160 

ctg ccg gcg gat att cgt cat get gca tat aac gat att aac tct gec 528 
Leu Pro Ala Asp lie Arg His Ala Ala Tyr Asn Asp lie Asn Ser Ala 
165 170 175 

age gcg ctg att gac gac tct acc tgt gcg gtg att gtc gaa ccc ate 576 
Ser Ala Leu lie Asp Asp Ser Thr Cys Ala Val He Val Glu Pro He 
180 185 190 

cag ggg gaa ggc ggt gtg gtg cca gec age aac gcg ttt tta caa ggt 624 
Gin Gly Glu Gly Gly Val Val Pro Ala Ser Asn Ala Phe Leu Gin Gly 
195 200 205 

ctg cgt gaa ttg tgt aac cgc cac aat gcg ctg ttg att ttt gat gaa 672 
Leu Arg Glu Leu Cys Asn Arg His Asn Ala Leu Leu He Phe Asp Glu 
210 215 220 

gta caa acc ggc gtc ggg cgc acc ggg gaa ctg tat gec tat atg cac 720 
Val Gin Thr Gly val Gly Arg Thr Gly Glu Leu Tyr Ala Tyr Met His 
225 230 235 240 

tac ggc gtg acg cct gat ctg tta act acc gee aaa gcg ctg ggc ggc 768 
Tyr Gly Val Thr Pro Asp Leu Leu Thr Thr Ala Lys Ala Leu Gly Gly 
245 250 255 

ggt ttc ccg gtc ggt gcg ttg ttg gca acc gaa gag tgc gec cgc gtg 816 
Gly Phe Pro Val Gly Ala Leu Leu Ala Thr Glu Glu Cys Ala Arg Val 
260 265 270 

atg acc gtt ggc act cat ggc acc acc tat ggc ggt aat ccg ctg gec 864 
Met Thr Val Gly Thr His Gly Thr Thr Tyr Gly Gly Asn Pro Leu Ala 
275 280 285 

teg gcg gtg gca ggc aaa gtg ctg gag etc ate aac aca cca gag atg 912 
Ser Ala Val Ala Gly Lys Val Leu Glu Leu He Asn Thr Pro Glu Met 
290 295 300 

ctt aat ggc gtt aaa cag cgt cac gac tgg ttt gtt gag cgt ctt aat 960 
Leu Asn Gly Val Lys Gin Arg His Asp Trp Phe Val Glu Arg Leu Asn 
305 310 315 320 

act att aat cac cgc tat ggt ttg ttc agt gaa gtt cgc ggc tta ggt 1008 
Thr He Asn His Arg Tyr Gly Leu Phe Ser Glu Val Arg Gly Leu Gly 
325 330 335 

ttg ctg att ggc tgt gta ctg aat gee gat tac gec ggg caa gcg aaa 1056 
Leu Leu He Gly Cys Val Leu Asn Ala Asp Tyr Ala Gly Gin Ala Lys 
340 345 350 

cag ate tct cag gaa gcg gcg aaa gca ggc gtg atg gta ctg att gcg 1104 
Gin He Ser Gin Glu Ala Ala Lys Ala Gly Val Met Val Leu He Ala 
355 360 365 

ggt ggc aac gtg gtg cgt ttt gcg cct gcg etc aat gtc age gaa gaa 1152 



-199- 



WO 01/48209 



PCTAJS00/34419 



Gly Gly Asn Val Val Arg Phe Ala Pro Ala Leu Asn Val Ser Glu Glu 
370 375 380 

gag gtg acg acc gga ctg gat cgc ttt gca get get tgc gaa cac ttt 1200 
Glu Val Thr Thr Gly Leu Asp Arg Phe Ala Ala Ala Cys Glu His Phe 
385 390 395 400 

gtt age cga ggt tea tea tga 1221 
Val Ser Arg Gly Ser Ser * 
405 



<210> 216 
<211> 3147 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (3147) 

<400> 216 

atg aaa att etc age ctg cgc ctg aaa aac ctg aac tea tta aaa ggc 48 
Met Lys lie Leu Ser Leu Arg Leu Lys Asn Leu Asn Ser Leu Lys Gly 
15 10 15 

gaa tgg aag att gat ttc acc cgc gag ccg ttc gec age aac ggg ctg 96 
Glu Trp Lys lie Asp Phe Thr Arg Glu Pro Phe Ala Ser Asn Gly Leu 
20 25 30 

ttt get att acc ggc cca aca ggt gcg ggg aaa acc acc ctg ctg gac 144 
Phe Ala lie Thr Gly Pro Thr Gly Ala Gly Lys Thr Thr Leu Leu Asp 
35 40 45 

gee att tgt ctg gcg ctg tat cac gaa act ccg cgt etc tct aac gtt 192 
Ala He Cys Leu Ala Leu Tyr His Glu Thr Pro Arg Leu Ser Asn Val 
50 55 60 

tea caa teg caa aat gat etc atg acc cgc gat acc gee gaa tgt ctg 240 
Ser Gin Ser Gin Asn Asp Leu Met Thr Arg Asp Thr Ala Glu Cys Leu 
65 70 75 80 

gcg gag gtg gag ttt gaa gtg aaa ggt gaa gcg tac cgt gca ttc tgg 288 
Ala Glu Val Glu Phe Glu Val Lys Gly Glu Ala Tyr Arg Ala Phe Trp 
85 90 95 

age cag aat egg gcg cgt aac caa ccc gac ggt aat ttg cag gtg cca 336 
Ser Gin Asn Arg Ala Arg Asn Gin Pro Asp Gly Asn Leu Gin Val Pro 
100 105 110 

cgc gta gag ctg gcg cgc tgc gee gac ggc aaa att etc gee gac aaa 384 
Arg Val Glu Leu Ala Arg Cys Ala Asp Gly Lys He Leu Ala Asp Lys 
115 120 125 

gtg aaa gat aag ctg gaa ctg aca gcg acg tta acc ggg ctg gat tac 432 
Val Lys Asp Lys Leu Glu Leu Thr Ala Thr Leu Thr Gly Leu Asp Tyr 
130 135 140 

999 cgc ttc acc cgt teg atg ctg ctt teg cag ggg caa ttt get gee 480 
Gly Arg Phe Thr Arg Ser Met Leu Leu Ser Gin Gly Gin Phe Ala Ala 
145 150 155 160 
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ttc ctg aat gcc aaa ccc aaa gaa cgc gcg gaa ttg etc gag gag tta 528 
Phe Leu Asn Ala Lys Pro Lys Glu Arg Ala Glu Leu Leu Glu Glu Leu 
165 170 175 

acc ggc act gaa ate tac ggg caa ate teg gcg atg gtt ttt gag cag 576 
Thr Gly Thr Glu He Tyr Gly Gin He Ser Ala Met Val Phe Glu Gin 
180 185 190 

cac aaa teg gcc cgc aca gag ctg gag aag ctg caa gcg cag gcc age 624 
His Lys Ser Ala Arg Thr Glu Leu Glu Lys Leu Gin Ala Gin Ala Ser 
195 200 205 

ggc gtc acg ttg etc acg ccg gaa caa gtg caa teg ctg aca gcg agt 672 
Gly Val Thr Leu lieu Thr Pro Glu Gin Val Gin Ser Leu Thr Ala Ser 
210 215 220 

ttg cag gta ctt act gac gaa gaa aaa cag tta att acc gcg cag cag 720 
Leu Gin Val Leu Thr Asp Glu Glu Lys Gin Leu He Thr Ala Gin Gin 
225 230 235 240 

caa gaa caa caa teg eta aac tgg tta acg cgt cag gac gaa ttg cag 768 
Gin Glu Gin Gin Ser Leu Asn Trp Leu Thr Arg Gin Asp Glu Leu Gin 
245 250 255 

caa gaa gcc age cgc cgt cag cag gcc ttg caa cag gcg tta gcc gaa 816 
Gin Glu Ala Ser Arg Arg Gin Gin Ala Leu Gin Gin Ala Leu Ala Glu 
260 265 270 

gaa gaa aaa gcg caa cct caa ctg gcg gcg ctt agt ctg gca caa ccg 864 
Glu Glu Lys Ala Gin Pro Gin Leu Ala Ala Leu Ser Leu Ala Gin Pro 
275 280 285 

gca cga aat ctt cgt cca cac tgg gaa cgc ate gca gaa cac age gcg 912 
Ala Arg Asn Leu Arg Pro His Trp Glu Arg He Ala Glu His Ser Ala 
290 295 300 

gcg ctg gcg cat att cgc cag cag att gaa gaa gta aat act cgc tta 960 
Ala Leu Ala His He Arg Gin Gin He Glu Glu Val Asn Thr Arg Leu 
305 310 315 320 

cag age aca atg gcg ctt cgc gcg age att cgc cac cac gcg gcg aag 1008 
Gin Ser Thr Met Ala Leu Arg Ala Ser He Arg His His Ala Ala Lys 
325 330 335 

cag tea gca gaa tta cag cag cag caa caa age ctg aat acc tgg tta 1056 
Gin Ser Ala Glu Leu Gin Gin Gin Gin Gin Ser Leu Asn Thr Trp Leu 
340 345 350 

cag gaa cac gac cgc ttc cgt cag tgg aac aac gaa ccg gcg ggt tgg 1104 
Gin Glu His Asp Arg Phe Arg Gin Trp Asn Asn Glu Pro Ala Gly Trp 
355 360 365 

cgt gcg cag ttc tec caa caa acc age gat cgc gag cat ctg egg caa 1152 
Arg Ala Gin Phe Ser Gin Gin Thr Ser Asp Arg Glu His Leu Arg Gin 
370 375 380 

tgg cag caa cag tta acc cat get gag caa aaa ctt aat gcg ctt gcg 1200 
Trp Gin Gin Gin Leu Thr His Ala Glu Gin Lys Leu Asn Ala Leu Ala 
385 390 395 400 
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gcg ate acg ttg acg tta acc gec gat gaa gtt get acc gec ctg gcg 1248 
Ala lie Thr lieu Thr Leu Thr Ala Asp Glu Val Ala Thr Ala Leu Ala 
405 410 415 

caa cat get gag caa cgc cca ctg cgt cag cac ctg gtc gcg ctg cat 1296 
Gin His Ala Glu Gin Arg Pro Leu Arg Gin His Leu Val Ala Leu His 
420 425 430 

gga cag att gtt ccc caa caa aaa cgt ctg gcg cag tta cag gtc get 1344 
Gly Gin lie Val Pro Gin Gin Lys Arg Leu Ala Gin Leu Gin Val Ala 
435 440 445 

ate cag aat gtc acg caa gaa cag acg caa cgt aac gec gca ctt aac 1392 
lie Gin Asn Val Thr Gin Glu Gin Thr Gin Arg Asn Ala Ala Leu Asn 
450 455 460 

gaa atg cgc cag cgt tat aaa gaa aag acg cag caa ctt gee gat gtg 1440 
Glu Met Arg Gin Arg Tyr Lys Glu Lys Thr Gin Gin Leu Ala Asp Val 
465 470 475 480 

aaa acc att tgc gag cag gaa gcg cgc ate aaa acg ctg gaa get caa 1488 
Lys Thr He Cys Glu Gin Glu Ala Arg He Lys Thr Leu Glu Ala Gin 
485 490 495 

cgt gca cag tta cag gcg ggt cag cct tgc cca ctt tgt ggt tec acc 1536 
Arg Ala Gin Leu Gin Ala Gly Gin Pro Cys Pro Leu Cys Gly Ser Thr 
500 505 510 

age cac ccg gcg gtc gag gcg tat cag gcg ctg gag cct ggc gtt aat 1584 
Ser His Pro Ala Val Glu Ala Tyr Gin Ala Leu Glu Pro Gly Val Asn 
515 520 525 

cag tct cga tta ctg gcg ctg gaa aac gaa gtt aaa aag etc ggt gaa 1632 
Gin Ser Arg Leu Leu Ala Leu Glu Asn Glu Val Lys Lys Leu Gly Glu 
530 535 540 

gaa ggt gcg acg eta cgt ggg caa ctg gac gee ata aea aag cag ctt 1680 
Glu Gly Ala Thr Leu Arg Gly Gin Leu Asp Ala He Thr Lys Gin Leu 
545 550 555 560 

cag cgt gat gaa aac gaa gcg caa age etc cga caa gat gag caa gca 1728 
Gin Arg Asp Glu Asn Glu Ala Gin Ser Leu Arg Gin Asp Glu Gin Ala 
565 570 575 

ctt act caa caa tgg caa gec gtc acg gee age etc aat ate acc ttg 1776 
Leu Thr Gin Gin Trp Gin Ala Val Thr Ala Ser Leu Asn He Thr Leu 
580 585 590 

cag cca ctg gac gat att caa ccg tgg ctg gat gca caa gat gag cac 1824 
Gin Pro Leu Asp Asp He Gin Pro Trp Leu Asp Ala Gin Asp Glu His 
595 600 605 

gaa cgc cag ctg egg tta etc age caa egg cat gaa tta caa ggg cag 1872 
Glu Arg Gin Leu Arg Leu Leu Ser Gin Arg His Glu Leu Gin Gly Gin 
610 615 620 

att gee gcg cat aat cag caa att ate cag tat caa cag caa att gaa 1920 
He Ala Ala His Asn Gin Gin He He Gin Tyr Gin Gin Gin He Glu 
625 630 635 640 

caa cgc cag caa eta ctt tta acg aca ttg acg ggt tat gca ctg aca 1968 
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Gin Arg Gin Gin Leu Leu Leu Thr Thr Leu Thr Gly Tyr Ala Leu Thr 
645 650 655 

ttg cca cag gaa gat gaa gaa gag age tgg ttg gcg aca cgt cag caa 2016 
Leu Pro Gin Glu Asp Glu Glu Glu Ser Trp Leu Ala Thr Arg Gin Gin 
660 665 670 

gaa gcg cag age tgg cag caa cgc cag aac gaa tta acc gcg ctg caa 2064 
Glu Ala Gin Ser Trp Gin Gin Arg Gin Asn Glu Leu Thr Ala Leu Gin 
675 680 685 

aac cgt att cag cag ctg acg ccg att ctg gaa acg ttg ccg caa agt 2112 
Asn Arg lie Gin Gin Leu Thr Pro lie Leu Glu Thr Leu Pro Gin Ser 
690 695 700 

gat gaa etc ccg cac tgc gaa gaa act gtg gta ttg gaa aac tgg egg 2160 
Asp Glu Leu Pro His Cys Glu Glu Thr Val Val Leu Glu Asn Trp Arg 
705 710 715 720 

cag gta cat gaa caa tgt etc gca tta cac age cag cag cag acg tta 2208 
Gin Val His Glu Gin Cys Leu Ala Leu His Ser Gin Gin Gin Thr Leu 
725 730 735 

cag caa cag gat gtt ctg gcg gcg caa agt ctg caa aaa gee cag gcg 2256 
Gin Gin Gin Asp Val Leu Ala Ala Gin Ser Leu Gin Lys Ala Gin Ala 
740 745 750 

cag ttt gac acc gcg eta cag gee age gtc ttt gac gat cag cag gcg 2304 
Gin Phe Asp Thr Ala lieu Gin Ala Ser Val Phe Asp Asp Gin Gin Ala 
755 760 765 

ttc ctt gcg gcg eta atg gat gaa caa aca eta acg cag ctg gaa cag 2352 
Phe Leu Ala Ala Leu Met Asp Glu Gin Thr Leu Thr Gin Leu Glu Gin 
770 775 780 

etc aag cag aat ctg gaa aac cag cgc cgt cag gcg caa act ctg gtc 2400; 
Leu Lys Gin Asn Leu Glu Asn Gin Arg Arg Gin Ala Gin Thr Leu Val 
785 790 795 800 

act cag aca gca gaa acg ctg gca cag cat caa caa cac cga cct gac 2448 
Thr Gin Thr Ala Glu Thr Leu Ala Gin His Gin Gin His Arg Pro Asp 
805 810 815 

gac ggg ttg get etc act gtg acg gtg gag cag att cag caa gag tta 2496 
Asp Gly Leu Ala Leu Thr Val Thr Val Glu Gin He Gin Gin Glu Leu 
820 825 830 

gcg caa act cac caa aag ttg cgt gaa aac acc acg agt caa ggc gag 2544 
Ala Gin Thr His Gin Lys Leu Arg Glu Asn Thr Thr Ser Gin Gly Glu 
835 840 845 

att cgc cag cag ctg aag cag gat gca gat aac cgt cag caa caa caa 2592 
He Arg Gin Gin Leu Lys Gin Asp Ala Asp Asn Arg Gin Gin Gin Gin 
850 855 860 

acc tta atg cag caa att get caa atg acg cag cag gtt gag gac tgg 2640 
Thr Leu Met Gin Gin He Ala Gin Met Thr Gin Gin Val Glu Asp Trp 
865 870 875 880 

gga tat ctg aat teg eta ata ggt tec aaa gag ggc gat aaa ttc cgc 2688 
Gly Tyr Leu Asn Ser Leu He Gly Ser Lys Glu Gly Asp Lys Phe Arg 
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885 890 895 

aag ttt gcc cag ggg ctg acg ctg gat aat tta gtc cat etc get aat 2736 
Lys Phe Ala Gin Gly Leu Thr Leu Asp Asn Leu Val His Leu Ala Asn 
900 905 910 

cag caa ctt acc egg ctg cac ggg cgc tat ctg tta cag cgc aaa gcc 2784 
Gin Gin Leu Thr Arg Leu His Gly Arg Tyr Leu Leu Gin Arg Lys Ala 
915 920 * 925 

age gag gcg ctg gaa gtc gag gtt gtt gat acc tgg cag gca gat gcg 2832 
Ser Glu Ala Leu Glu Val Glu Val Val Asp Thr Trp Gin Ala Asp Ala 
930 935 940 

gta cgc gat acc cgt acc ctt tec ggc ggc gaa agt ttc etc gtt agt 2880 
Val Arg Asp Thr Arg Thr Leu Ser Gly Gly Glu Ser Phe Leu Val Ser 
945 950 955 960 

ctg gcg ctg gcg ctg gcg ctt teg gat ctg gtc age cat aaa aca cgt 2928 
Leu Ala Leu Ala Leu Ala Leu Ser Asp Leu Val Ser His Lys Thr Arg 
965 970 975 

att gac teg ctg ttc ctt gat gaa ggt ttt ggc acg ctg gat age gaa 2976 
lie Asp Ser Leu Phe Leu Asp Glu Gly Phe Gly Thr Leu Asp Ser Glu 
980 985 990 

acg ctg gat acc gcc ctt gat gcg ctg gat gcc ctg aac gcc agt ggc 3024 
Thr Leu Asp Thr Ala Leu Asp Ala Leu Asp Ala Leu Asn Ala Ser Gly 
995 1000 1005 

aaa acc ate ggt gtg att age cac gta gaa gcg atg aaa gag cgt att 3072 
Lys Thr lie Gly Val. lie Ser His Val Glu Ala Met Lys Glu Arg He 
1010 1015 1020 

ccg gtg cag ate aaa gtg aaa aag ate aac ggc ctg ggc tac age aaa 3120 
Pro Val Gin lie Lys Val Lys Lys He Asn Gly Leu Gly Tyr Ser Lys 
1025 1030 1035 1040 

ctg gaa agt acg ttt gca gtg aaa taa 3147 
Leu Glu Ser Thr Phe Ala Val Lys * 
1045 



<210> 217 
<211> 1203 
<212> DHA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1203) 

<400> 217 

atg cgc ate ctt cac acc tea gac tgg cat etc ggc cag aac ttc tac 48 
Met Arg He Leu His Thr Ser Asp Trp His Leu Gly Gin Asn Phe Tyr 
1 5 10 15 

agt aaa age cgc gaa get gaa cat cag get ttt ctt gac tgg ctg ctg 96 
Ser Lys Ser Arg Glu Ala Glu His Gin Ala Phe Leu Asp Trp Leu Leu 
20 25 30 
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gag aca gca caa acc cat cag gtg gat gcg att att gtt gcc ggt gat 144 
Glu Thr Ala Gin Thr His Gin Val Asp Ala He He Val Ala Gly Asp 
35 40 45 

gtt ttc gat acc ggc teg ccg ccc agt tac gcc cgc acg tta tac aac 192 
Val Phe Asp Thr Gly Ser Pro Pro Ser Tyr Ala Arg Thr Leu Tyr Asn 
50 55 60 

cgt ttt gtt gtc aat tta cag caa act ggc tgt cat ctg gtg gta ctg. 240 
Arg Phe Val Val Asn Leu Gin Gin Thr Gly Cys His Leu Val Val Leu 
65 70 75 80 

I 

gca gga aac cat gac teg gtc gcc acg ctg aat gaa teg cgc gat ate 288 
Ala Gly Asn His Asp Ser Val Ala Thr Leu Asn Glu Ser Arg Asp He 
85 90 95 

atg gcg ttc etc aat act acc gtg gtc gcc age gcc gga cat gcg ccg 336 
Met Ala Phe Leu Asn Thr Thr Val Val Ala Ser Ala Gly His Ala Pro 
100 105 110 

caa ate ttg cct cgt cgc gac ggg acg cca ggc gca gtg ctg tgc ccc 384 
Gin He Leu Pro Arg Arg Asp Gly Thr Pro Gly Ala Val Leu Cys Pro 
115 120 125 

att ccg ttt tta cgt ccg cgt gac att att acc age cag gcg ggg ctt 432 
He Pro Phe Leu Arg Pro Arg Asp He He Thr Ser Gin Ala Gly Leu 
130 135 140 

aac ggt att gaa aaa cag cag cat tta ctg gca gcg att acc gat tat 480 
Asn Gly He Glu Lys Gin Gin His Leu Leu Ala Ala He Thr Asp Tyr 
145 150 155 160 

tac caa caa cac tat gcc gat gcc tgc aaa ctg cgc ggc gat cag cct 52 B 
Tyr Gin Gin His Tyr Ala Asp Ala Cys Lys Leu Arg Gly Asp Gin Pro 
165 170 175 

ctg ccc ate ate gcc acg gga cat tta acg acc gtg ggg gcc agt aaa 576 
Leu Pro He He Ala Thr Gly His Leu Thr Thr Val Gly Ala Ser Lys 
180 185 190 

agt gac gcc gtg cgt gac att tat att ggc acg ctg gac gcg ttt ccg 624 
Ser Asp Ala Val Arg Asp He Tyr He Gly Thr Leu Asp Ala Phe Pro 
195 200 205 

gca caa aac ttt cca cca gcc gac tac ate gcg etc ggg cat att cac 672 
Ala Gin Asn Phe Pro Pro Ala Asp Tyr He Ala Leu Gly His He His 
210 215 220 

cgc gca cag att att ggc ggc atg gaa cat gtt cgc tat tgc ggc tec 720 
Arg Ala Gin He He Gly Gly Met Glu His Val Arg Tyr Cys Gly Ser 
225 230 235 240 

ccc att cca ctg agt ttt gat gaa tgc ggt aag agt aaa tat gtc cat 768 
Pro He Pro Leu Ser Phe Asp Glu Cys Gly Lys Ser Lys Tyr Val His 
245 250 255 

ctg gtg aca ttt tea aac ggc aaa tta gag age gtg gaa aac ctg aac 816 
Leu Val Thr Phe Ser Asn Gly Lys Leu Glu Ser Val Glu Asn Leu Asn 
260 265 270 

gta ccg gta acg caa ccc atg gca gtg ctg aaa ggc gat ctg gcg teg 864 
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Val Pro Val Thr Gin Pro Met Ala Val Leu Lys Gly Asp Leu Ala Ser 
275 280 285 

att acc gca cag ctg gaa cag tgg cgc gat gta teg cag gag cca cct 912 
lie Thr Ala Gin Leu Glu Gin Trp Arg Asp Val Ser Gin Glu Pro Pro 
290 295 300 

gtc tgg ctg gat ate gaa ate act act gat gag tat ctg cat gat att 960 
Val Trp Leu Asp He Glu He Thr Thr Asp Glu Tyr Leu His Asp He 
305 310 315 320 

cag cgc aaa ate cag gca tta acc gaa tea ttg cct gtc gaa gta ttg 1008 
Gin Arg Lys He Gin Ala Leu Thr Glu Ser Leu Pro Val Glu Val Leu 
325 330 335 

ctg gta cgt egg agt cgt gaa cag cgc gag cgt gtg tta gee age caa 1056 
Leu Val Arg Arg Ser Arg Glu Gin Arg Glu Arg Val Leu Ala Ser Gin 
340 345 350 

cag cgt gaa acc etc age gaa etc age gtc gaa gag gtg ttc aat cgc 1104 
Gin Arg Glu Thr Leu Ser Glu Leu Ser Val Glu Glu Val Phe Asn Arg 
355 360 365 

cgt ctg gca ctg gaa gaa ctg gat gaa teg cag cag caa cgt ctg cag 1152 
Arg Leu Ala Leu Glu Glu Leu Asp Glu Ser Gin Gin Gin Arg Leu Gin 
370 375 380 

cat ctt ttc acc acg acg ttg cat acc etc gee gga gaa cac gaa gca 1200 
His Leu Phe Thr Thr Thr Leu His Thr Leu Ala Gly Glu His Glu Ala 
385 390 395 400 

tga 1203 
* 



<210> 218 
<211> 765 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(765) 

<400> 218 

atg aac gat eta ttt tea ctg gca gga aaa aat ate ttg att acc ggt 48 

Met Asn Asp lieu Phe Ser Leu Ala Gly Lys Asn He Leu He Thr Gly 

1 5 10 15 

tea gca cag ggc att ggc ttt tta ctg gca acc ggc ctg ggt aaa tat 96 
Ser Ala Gin Gly He Gly Phe Leu Leu Ala Thr Gly Leu Gly Lys Tyr 
20 25 30 

ggc gca caa ata att att aat gat att act gec gaa cgc gca gaa ctt 144 
Gly Ala Gin He He He Asn Asp He Thr Ala Glu Arg Ala Glu Leu 
35 40 45 

get gta gaa aaa etc cac cag gag ggt att cag gec gtt gec gca cct 192 
Ala Val Glu Lys Leu His Gin Glu Gly He Gin Ala Val Ala Ala Pro 
50 55 60 
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ttt aat gtt act cat aaa cat gaa att gat gcc gcc gtt gaa cat ate 240 
Phe Asn Val Thr His Lys His Glu lie Asp Ala Ala Val Glu His lie 
65 70 75 80 

gaa aag gac ate ggc ccc att gat gtg ctg gtg aat aac gcc ggt ate 288 
Glu Lys Asp lie Gly Pro He Asp Val Leu Val Asn Asn Ala Gly He 
85 90 95 

cag cgc cgt cat cct ttt act gag ttc cct gaa caa gag tgg aat gat 336 
Gin Arg Arg His Pro Phe Thr Glu Phe Pro Glu Gin Glu Trp Asn Asp 
100 105 110 

gtg ate gca gta aac cag acc gcc gtg ttc ctg gta teg caa gcg gta 384 
Val He Ala Val Asn Gin Thr Ala Val Phe Leu Val Ser Gin Ala Val 
115 120 125 

act cgt cac atg gtt gaa cgc aag gca ggt aaa gtt att aat att tgc 432 
Thr Arg His Met Val Glu Arg Lys Ala Gly Lys Val He Asn He Cys 
130 135 140 

teg atg caa age gaa ctg gga cgt gac acc ate acc cct tat gcc gca 480 
Ser Met Gin Ser Glu Leu Gly Arg Asp Thr He Thr Pro Tyr Ala Ala 
145 150 155 160 

teg aaa ggg gcg gta aaa atg etc acc cgc ggc atg tgc gtc gag ctg 528 
Ser Lys Gly Ala Val Lys Met Leu Thr Arg Gly Met Cys Val Glu Leu 
165 170 175 

gcg cgc cac aat att cag gtc aac ggt att gcg ccg ggc tat ttc aaa 576 
Ala Arg His Asn He Gin Val Asn Gly He Ala Pro Gly Tyr Phe Lys 
180 185 190 

aca gaa atg act aaa gca ctg gtt gag gac gaa gcc ttc acc gcc tgg 624 
Thr Glu Met Thr Lys Ala Leu Val Glu Asp Glu Ala Phe Thr Ala Trp 
195 200 205 

ttg tgc aaa egg acc ccc gcc gca cgc tgg gga gat ccg cag gaa ctg 672 
Leu Cys Lys Arg Thr Pro Ala Ala Arg Trp Gly Asp Pro Gin Glu Leu 
210 215 220 

att ggt get gcg gtg ttc ctt tct tea aaa gcc tct gat ttc gta aac 720 
He Gly Ala Ala Val Phe Leu Ser Ser Lys Ala Ser Asp Phe Val Asn 
225 . 230 235 240 

ggc cac etg ttg ttt gtt gat ggc ggc atg tta gtg get gtt taa 765 
Gly His Leu Leu Phe Val Asp Gly Gly Met Leu Val Ala Val * 
245 250 



<210> 219 
<211> 1032 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1032) 

<400> 219 
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atg caa gtg aaa aca cag tec tgc gtt gtt gcg ggc aag aaa act gtt 48 
Met Gin Val Lys Thr Gin Ser Cys Val Val Ala Gly Lys Lys Thr Val 
1 5 10 15 

gec gtt acc gag cag acg ata gat tgg aat aat aat gga aca tta gta 96 
Ala Val Thr Glu Gin Thr lie Asp Trp Asn Asn Asn Gly Thr Leu Val 
20 25 30 

caa ata acc cga ggt gga att tgc ggt tec gat tta cat tat tat cag 144 
Gin lie Thr Arg Gly Gly lie Cys Gly Ser Asp Leu His Tyr Tyr Gin 
35 40 45 

gaa gga aaa gta ggt aat ttc atg ata aag gca ccg atg gtg tta ggt 192 
Glu Gly Lys Val Gly Asn Phe Met He Lys Ala Pro Met Val Leu Gly 
50 55 60 

cat gaa gtt ate ggt aaa gtt att cat age gac tea tea gaa tta cat 240 
His Glu Val He Gly Lys Val He His Ser Asp Ser Ser Glu Leu His 
65 70 75 80 

gaa ggg caa acg gta gee att aat ccg tct aaa ccg tgc ggt cac tgc 288 
Glu Gly Gin Thr Val Ala He Asn Pro Ser Lys Pro Cys Gly His Cys 
85 90 95 

aaa tac tgc att gaa cat aac gag aat cag tgt aca gat atg cgt ttt 336 
Lys Tyr Cys He Glu His Asn Glu Asn Gin Cys Thr Asp Met Arg Phe 
100 105 110 

ttt ggc agt gec atg tat ttc cct cat gtt gat ggt ggt ttt acc cgt 384 
Phe Gly Ser Ala Met Tyr Phe Pro His Val Asp Gly Gly Phe Thr Arg 
115 120 125 

tat aaa atg gtc gaa acg teg caa tgt gtc cct tat ccg gec aaa get 432 
Tyr Lys Met Val Glu Thr Ser Gin Cys Val Pro Tyr Pro Ala Lys Ala 
130 135 140 

gac gaa aag gtt atg get ttt gee gaa cct tta gec gtc gcg att cat 480 
Asp Glu Lys Val Met Ala Phe Ala Glu Pro Leu Ala Val Ala He His 
145 150 155 160 

gee gca cat cag gee ggc gag tta cag ggc aag cga gta ttt att tec 528 
Ala Ala His Gin Ala Gly Glu Leu Gin Gly Lys Arg Val Phe He Ser 
165 170 175 

ggt gtt gga ccc att ggc tgc ctg att gtc agt gca gtg aaa aca ctg 576 
Gly Val Gly Pro He Gly Cys Leu He Val Ser Ala Val Lys Thr Leu 
180 185 190 

ggg gec gcg gaa att gtc tgt get gat gtg agt ccc cgt tec ctt teg 624 
Gly Ala Ala Glu He Val Cys Ala Asp Val Ser Pro Arg Ser Leu Ser 
195 200 205 

ctg ggc aaa gag atg ggg gcg gat gtg etc gta aac cca caa aac gac 672 
lieu Gly Lys Glu Met Gly Ala Asp Val Leu Val Asn Pro Gin Asn Asp 
210 215 220 

gac atg gat cac tgg aaa gcg gaa aaa ggc tat ttc gat gtc age ttt 720 
Asp Met Asp His Trp Lys Ala Glu Lys Gly Tyr Phe Asp Val Ser Phe 
225 230 235 240 

gaa gtg tec ggt cat cct tea tea gtg aat acc tgt ctg gag gtc act 768 
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Glu Val Ser Gly His Pro Ser Ser Val Asn Thr Cys Leu Glu Val Thr 
245 250 255 

cgt gca cgc ggc gta atg gtg cag gta ggt atg gga ggc gcg atg gca 816 
Arg Ala Arg Gly Val Met Val Gin Val Gly Met Gly Gly Ala Met Ala 
260 265 270 

gaa ttc cca atg atg acg ttg att ggt aag gag att tea etc aga ggc 864 
Glu Phe Pro Met Met Thr Leu lie Gly Lys Glu lie Ser Leu Arg Gly 
275 280 285 

tct ttc cgt ttt acc age gaa ttt aat ace gca gtg tea tgg ctg gcg 912 
Ser Phe Arg Phe Thr Ser Glu Phe Asn Thr Ala Val Ser Trp Leu Ala 
290 295 300 

aat ggc gtt ate aat cca ctg cct tta ctg agt get gaa tat ccc ttc 960 
Asn Gly Val He Asn Pro Leu Pro Leu Leu Ser Ala Glu Tyr Pro Phe 
305 310 315 320 

act gac ctg gaa gag gcg eta cgt ttc gee ggt gat aaa acc cag gca 1008 
Thr Asp Leu Glu Glu Ala Leu Arg Phe Ala Gly Asp Lys Thr Gin Ala 
325 330 335 

gca aaa gtc cag ctt gtt ttc taa 1032 
Ala Lys Val Gin Leu Val Phe * 
340 



<210> 220 
<211> 1647 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1647) 

<400> 220 

atg gat teg caa cgc aat ctt tta gtc ate get ttg ctg ttc gtg tct 48 
Met Asp Ser Gin Arg Asn Leu Leu Val He Ala Leu Leu Phe Val Ser 
15 10 15 

ttc atg ate tgg caa gee tgg gag cag gat aaa aac ccg caa cct cag 96 
Phe Met He Trp Gin Ala Trp Glu Gin Asp Lys Asn Pro Gin Pro Gin 
20 25 30 

gee caa cag acc acg cag aca acg acc acc gca gcg ggt age gee gee 144 
Ala Gin Gin Thr Thr Gin Thr Thr Thr Thr Ala Ala Gly Ser Ala Ala 
35 40 45 

gac cag ggc gta ccg gee agt ggc cag ggg aaa ctg ate teg gtt aag 192 
Asp Gin Gly Val Pro Ala Ser Gly Gin Gly Lys Leu He Ser Val Lys 
50 55 60 

acc gac gtg ctt gat ctg acc ate aac acc cgt ggt ggt gat gtt gag 240 
Thr Asp Val Leu Asp Leu Thr He Asn Thr Arg Gly Gly Asp Val Glu 
65 70 75 80 

caa get ctg ctg cct get tac ccg aaa gag ctg aac tct acc cag ccg 288 
Gin Ala Leu Leu Pro Ala Tyr Pro Lys Glu Leu Asn Ser Thr Gin Pro 
85 90 95 
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ttc cag ctg ttg gaa act tea ccg cag ttt att tat cag gca cag age 336 
Phe Gin Leu Leu Glu Thr Ser Pro Gin Phe lie Tyr Gin Ala Gin Ser 
100 105 110 

ggt ctg acc ggt cgt gat ggc ccg gat aac ccg get aac ggc ccg cgt 384 
Gly Leu Thr Gly Arg Asp Gly Pro Asp Asn Pro Ala Asn Gly Pro Arg 
115 120 125 

ccg ctg tat aac gtt gaa aaa gac get tat gtg ctg get gaa ggt caa 432 
Pro Leu Tyr Asn Val Glu Lys Asp Ala Tyr Val Leu Ala Glu Gly Gin 
130 135 140 

aac gaa ctg cag gtg ccg atg acg tat acc gac gcg gca ggc aac acg 480 
Asn Glu Leu Gin Val Pro Met Thr Tyr Thr Asp Ala Ala Gly Asn Thr 
145 150 155 160 

ttt acc aaa acg ttt gtc ctg aaa cgt ggt gat tac get gtc aac gtc 528 
Phe Thr Lys Thr Phe Val Leu Lys Arg Gly Asp Tyr Ala Val Asn Val 
165 170 175 

aac tac aac gtg cag aac get ggc gag aaa ccg ctg gaa ate tec teg 576 
Asn Tyr Asn Val Gin Asn Ala Gly Glu Lys Pro Leu Glu lie Ser Ser 
180 185 190 

ttt ggt cag ttg aag caa tec ate act ctg cca ccg cat etc gat acc 624 
Phe Gly Gin Leu Lys Gin Ser lie Thr Leu Pro Pro His Leu Asp Thr 
195 200 205 

gga age age aac ttc gca ctg cac acc ttc cgt ggc gcg gcg tac tec 672 
Gly Ser Ser Asn Phe Ala Leu His Thr Phe Arg Gly Ala Ala Tyr Ser 
210 215 220 

acg cct gac gag aag tat gag aaa tac aag ttc gat acc att gec gat 720 
Thr Pro Asp Glu Lys Tyr Glu Lys Tyr Lys Phe Asp Thr He Ala Asp 
225 230 235 240 

aac gaa aac ctg aac ate tct teg aaa ggt ggt tgg gtg gcg atg ctg 768 
Asn Glu Asn Leu Asn He Ser Ser Lys Gly Gly Trp Val Ala Met Leu 
245 250 255 

caa cag tat ttc gcg acg gcg tgg ate ccg cat aac gac ggt acc aac 816 
Gin Gin Tyr Phe Ala Thr Ala Trp He Pro His Asn Asp Gly Thr Asn 
260 265 270 

aac ttc tat acc get aat ctg ggt aac ggc ate gec get ate ggc tat 864 
Asn Phe Tyr Thr Ala Asn Leu Gly Asn Gly He Ala Ala He Gly Tyr 
275 280 285 

aaa tct cag ccg gta ctg gtt cag cct ggt cag act ggc gcg atg aac 912 
Lys Ser Gin Pro Val Leu Val Gin Pro Gly Gin Thr Gly Ala Met Asn 
290 295 300 

age acc ctg tgg gtt ggc ccg gaa ate cag gac aaa atg gca get gtt 960 
Ser Thr Leu Trp Val Gly Pro Glu He Gin Asp Lys Met Ala Ala Val 
305 310 315 320 

get ccg cac ctg gat ctg acc gtt gat tac ggt tgg ttg tgg ttc ate 1008 
Ala Pro His Leu Asp Leu Thr Val Asp Tyr Gly Trp Leu Trp Phe He 
325 330 335 
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tct cag ccg ctg ttc aaa ctg ctg aaa tgg ate cat age ttt gtg ggt 1056 
Ser Gin Pro Leu Phe Lys Leu Leu Lys Txp lie His Ser Phe Val Gly 
340 345 350 

aac tgg ggc ttc tec att ate ate ate ace ttt ate gtt cgt ggc ate 1104 
Asn Trp Gly Phe Ser lie He He He Thr Phe He Val Arg Gly He 
355 360 365 

atg tac ccg ctg acc aaa gcg cag tac acc tec atg gcg aag atg cgt 1152 
Met Tyr Pro Leu Thr Lys Ala Gin Tyr Thr Ser Met Ala Lys Met Arg 
370 375 380 

atg ttg cag ccg aag att cag gca atg cgt gag cgt ctg ggc gat gac 1200 
Met Leu Gin Pro Lys He Gin Ala Met Arg Glu Arg Leu Gly Asp Asp 
385 390 395 400 

aaa cag cgt ate age cag gaa atg atg gcg ctg tac aaa get gag aag 1248 
Lys Gin Arg He Ser Gin Glu Met Met Ala Leu Tyr Lys Ala Glu Lys 
405 410 415 

gtt aac ccg ctg ggc ggc tgc ttc ccg ctg ctg ate cag atg cca ate 1296 
Val Asn Pro Leu Gly Gly Cys Phe Pro Leu Leu He Gin Met Pro He 
420 425 430 

ttc ctg gcg ttg tac tac atg ctg atg ggt tec gtt gaa ctg cgt cag 1344 
Phe Leu Ala Leu Tyr Tyr Met Leu Met Gly Ser Val Glu Leu Arg Gin 
435 440 ' 445 

gca ccg ttt gca ctg tgg ate cac gac ctg teg gca cag gac ccg tac 1392 
Ala Pro Phe Ala Leu Trp He His Asp Leu Ser Ala Gin Asp Pro Tyr 
450 455 460 

tac ate ctg ccg ate ctg atg ggc gta acg atg ttc ttc att cag aag 1440 
Tyr He Leu Pro He Leu Met Gly Val Thr Met Phe Phe He Gin Lys 
465 470 475 480 

atg teg ccg acc aca gtg acc gac ccg atg cag cag aag ate atg acc 1488 
Met Ser Pro Thr Thr Val Thr Asp Pro Met Gin Gin Lys He Met Thr 
485 490 495 

ttt atg ccg gtc ate ttc acc gtg ttc ttc ctg tgg ttc ccg tea ggt 1536 
Phe Met Pro Val lie Phe Thr Val Phe Phe Leu Trp Phe Pro Ser Gly 
500 505 510 

ctg gtg ctg tac tat ate gtc age aac ctg gta acc att att cag cag 1584 
Leu Val Leu Tyr Tyr He Val Ser Asn Leu Val Thr He He Gin Gin 

515 . 520. 525 

cag ctg att tac cgt ggt ctg gaa aaa cgt ggc ctg cat age cgc gag 1632 
Gin Leu He Tyr Arg Gly Leu Glu Lys Arg Gly Leu His Ser Arg Glu 
530 535 . 540 

aag aaa aaa tec tga 1647 

Lys Lys Lys Ser * 

545 



<210> 221 
<211> 1629 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (1629) 

<400> 221 

atg aaa act cgc gac teg caa tea agt gac gtg att ate att ggc ggc 48 
Met Lys Thr Arg Asp Ser Gin Ser Ser Asp Val lie lie lie Gly Gly 
15 10 15 

ggc gca acg gga gee ggg att gee cgc gac tgt. gee ctg cgc ggg ctg 96 
Gly Ala Thr Gly Ala Gly lie Ala Arg Asp Cys Ala Leu Arg Gly Leu 
20 25 30 

cgc gtg att ttg gtt gag cgc cac gac ate gca ace ggt gee ace ggg 144 
Arg Val lie Leu Val Glu Arg His Asp He Ala Thr Gly Ala Thr Gly 
35 40 45 

cgt aac cac ggc ctg ctg cac age ggt gcg cgc tat gcg gta ace gat 192 
Arg Asn His Gly Leu Leu His Ser Gly Ala Arg Tyr Ala Val Thr Asp 
50 55 60 

gcg gaa teg gee cgc gaa tgc att agt gaa aac cag ate ctg aaa cgc 240 
Ala Glu Ser Ala Arg Glu Cys He Ser Glu Asn Gin He Leu Lys Arg 
65 70 75 80 

att gca cgt cac tgc gtt gaa cca ace aac ggc ctg ttt ate acc ctg 288 
lie Ala Arg His Cys Val Glu Pro Thr Asn Gly Leu Phe He Thr Leu 
85 90 95 

ccg gaa gat gac etc tec ttc cag gee act ttt att cgc gee tgc gaa 336 
Pro Glu Asp Asp Leu Ser Phe Gin Ala Thr Phe He Arg Ala Cys Glu 
100 105 110 

gaa gca ggg ate age gca gaa get ata gac ccg cag caa gcg cgc att 384 
Glu Ala Gly He Ser Ala Glu Ala He Asp Pro Gin Gin Ala Arg He 
115 120 125 

ate gaa cct gee gtt aac ccg gca ctg att ggc gcg gtg aaa gtt ccg 432 
He Glu Pro Ala Val Asn Pro Ala Leu He Gly Ala Val Lys Val Pro 
130 135 140 

gat ggc acc gtt gat cca ttt cgt ctg acc gca gca aac atg ctg gat 480 
Asp Gly Thr Val Asp Pro Phe Arg Leu Thr Ala Ala Asn Met Leu Asp 
145 150 155 160 

gee aaa gaa cac ggt gee gtt ate ctt acc get cat gaa gtc acg ggg 528 
Ala Lys Glu His Gly Ala Val He Leu Thr Ala His Glu Val Thr Gly 
165 170 175 

ctg att cgt gaa ggc gcg acg gtg tgc ggt gtt cgt gta cgt aac cat 576 
Leu He Arg Glu Gly Ala Thr Val Cys Gly Val Arg Val Arg Asn His 
180 185 190 

etc acc ggc gaa act cag gee ctt cat gca cct gtc gtg gtt aat gee 624 
Leu Thr Gly Glu Thr Gin Ala Leu His Ala Pro Val Val Val Asn Ala 
195 200 205 

get ggg ate tgg ggg caa cac att gee gaa tat gee gat ctg cgc att 672 
Ala Gly He Trp Gly Gin His He Ala Glu Tyr Ala Asp Leu Arg He 
210 215 220 
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cgc atg ttc ccg gcg aaa gga teg ctg ctg ate atg gat cac cgc att 720 
Arg Met Phe Pro Ala Lys Gly Ser Leu Leu He Met Asp His Arg He 
225 230 235 240 

aac cag cat gtg ate aac cgc tgc cgt aaa cct tec gac gec gat att 768 
Asn Gin His Val He Asn Arg Cys Arg Lys Pro Ser Asp Ala Asp lie 
245 250 255 

ctg gtg cct ggc gat acc att teg ctg att ggt acc acc tct tta cgt 816 
Leu Val Pro Gly Asp Thr He Ser Leu He Gly Thr Thr Ser Leu Arg 
260 265 270 

att gat tac aac gag att gac gat aat cga gtg acg gca gaa gag gtt 864 
He Asp Tyr Asa Glu He Asp Asp Asn Arg Val Thr Ala Glu Glu Val 
275 280 285 

gat att ctg ctg cgt gaa ggg gaa aaa ctg gec ccc gtg atg gcg aaa 912 
Asp He Leu Leu Arg Glu Gly Glu Lys Leu Ala Pro Val Met Ala Lys 
290 295 300 

acg cgc att ttg egg gec tat tct ggc gtg cgc ccg ctg gtt gec age 960 
Thr Arg He Leu Arg Ala Tyr Ser Gly Val Arg Pro Leu Val Ala Ser 
305 310 315 320 . . 

gat gac gac ccg age gga cgt aac gtc age cgt ggc ate gtg ctg etc 1008 
Asp Asp Asp Pro Ser Gly Arg Asn Val Ser Arg Gly He Val Leu Leu 
325 330 335 

gac cat get gaa cgc gat ggt ctg gac gga ttt ate acc ate acc ggt 1056 
Asp His Ala Glu Arg Asp Gly Leu Asp Gly Phe He Thr He Thr Gly 
340 345 350 

ggc aaa ctg atg acc tat egg ctg atg get gaa tgg get acc gac gcg 1104 
Gly Lys Leu Met Thr Tyr Arg Leu Met Ala Glu Trp Ala Thr Asp Ala 
355 ~ 360 ~ 365 

gta tgc cgc aaa ctg ggc aac acg cgc ccc tgt acg act gee gat ctg 1152 
Val Cys Arg Lys Leu Gly Asn Thr Arg Pro Cys Thr Thr Ala Asp Leu 
370 375 380 

gca ctg cct ggt tea caa gaa ccc get gaa gtt acc ttg cgt aaa gtc 1200 
Ala Leu Pro Gly Ser Gin Glu Pro Ala Glu Val Thr Leu Arg Lys Val 
385 390 395 400 

ate tec ctg cct gec ccg ctg cgc ggt tct gcg gtt tat cgt cat ggc 1248 
He Ser Leu Pro Ala Pro Leu Arg Gly Ser Ala Val Tyr Arg His Gly 
405 410 415 

gat cgc acg cct gec tgg ctg age gaa ggc cgt ctg cac cgt age ctg 1296 
Asp Arg Thr Pro Ala Trp Leu Ser Glu Gly Arg Leu His Arg Ser Leu 
420 425 430 

gta tgt gag tgc gaa gcg gta act gcg ggt gaa gtg cag tac gcg gta 1344 
Val Cys Glu Cys Glu Ala Val Thr Ala Gly Glu Val Gin Tyr Ala Val 
435 440 445 

gaa aat tta aac gtt aat age ctg ctg gat tta cgc cgt cgt acc cgt 1392 
Glu Asn Leu Asn Val Asn Ser Leu Leu Asp Leu Arg Arg Arg Thr Arg 
450 455 460 
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gtg at 9 S9 C stcc tgc cag ggc gaa etc tgc gec tgc cgc get gee 1440 
Val Gly Met Gly Thr Cys Gin Gly Glu Leu Cys Ala Cys Arg Ala Ala 
465 470 475 480 

gga ctg ctg caa cgt ttt aac gtc acg acg tec gcg caa tct ate gag 1488 
Gly Leu Leu Gin Arg Phe Asn Val Thr Thr Ser Ala Gin Ser lie Glu 
485 490 495 

caa ctt tec ace ttc ctt aac gaa cgc tgg aaa ggc gtg caa ccc ate 1536 
Gin Leu Ser Thr Phe Leu Asn Glu Arg Trp Lys Gly Val Gin Pro lie 
500 505 510 

gee tgg gga gat gca ctg cgc gaa age gaa ttt acc cgc tgg gtt tat 1584 
Ala Trp Gly Asp Ala Leu Arg Glu Ser Glu Phe Thr Arg Trp Val Tyr 
515 520 525 

cag gga ttg tgt ggt ctg gag aag gag cag aaa gat gcg ctt tga 1629 
Gin Gly Leu Cys Gly Leu Glu Lys Glu Gin Lys Asp Ala Leu * 
530 535 540 



<210> 222 
<211> 1260 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1260) 

<400> 222 

atg cgc ttt gat act gtc att atg ggc ggc ggc etc gee gga tta etc 48 

Met Arg Phe Asp Thr Val lie Met Gly Gly Gly Leu Ala Gly Leu Leu 
15 10 15 

tgt ggc ctg caa ctg caa aaa cac ggc ctg cgc tgt gee att gtc act 96 
Cys Gly Leu Gin Leu Gin Lys His Gly Leu Arg Cys Ala lie Val Thr 
20 25 30 

cgt ggt caa age gca ctg cat ttc tea tec gga teg ctg gat ttg ctg 144 
Arg Gly Gin Ser Ala Leu His Phe Ser Ser Gly Ser Leu Asp Leu Leu 
35 40 45 

age cat ctg cca gat ggt caa ccg gtg aca gac att cac agt gga ctg 192 
Ser His Leu Pro Asp Gly Gin Pro Val Thr Asp lie His Ser Gly Leu 
50 55 60 

gaa tct ttg cgt cag cag gca cca gee cat cct tac tec ctt etc gag 240 
Glu Ser Leu Arg Gin Gin Ala Pro Ala His Pro Tyr Ser Leu Leu Glu 
65 70 75 80 

cca caa cgc gtg etc gat etc get tgc cag gcg cag gca tta ate get 288 
Pro Gin Arg Val Leu Asp Leu Ala Cys Gin Ala Gin Ala Leu lie Ala 
85 90 95 

gaa age ggt gcg caa ttg cag ggc age gta gaa ctt get cac cag egg 336 
Glu Ser Gly Ala Gin Leu Gin Gly Ser Val Glu Leu Ala His Gin Arg 
100 105 no 

gtt acg ccg etc ggc act ctg cgc tct acc tgg eta agt teg cca gaa 384 
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Val Thr Pro Leu Gly Thr Leu Arg Ser Thr Trp Leu Ser Ser Pro Glu 
115 120 125 

gtc ccc gtc tgg ccg ctg ccc gcg aag aaa ata tgt gta gtg gga att 432 
Val Pro Val Trp Pro Leu Pro Ala Lys Lys lie Cys Val Val Gly lie 
130 135 140 

age ggc ctg atg gat ttt cag gcg cac ctt gcg gca get teg ttg cgt 480 
Ser Gly Leu Met Asp Phe Gin Ala His Leu Ala Ala Ala Ser Leu Arg 
145 150 155 160 

gaa etc ggc ctt gee gtt gaa ace gca gaa ata gag ctg ccg gaa ctg 528 
Glu Leu Gly Leu Ala Val Glu Thr Ala Glu lie Glu Leu Pro Glu Leu 
165 170 175 

gat gtg ctg cgc aat aac gee acc gaa ttt cgc gcg gtg aat ate gee 576 
Asp Val Leu Arg Asn Asn Ala Thr Glu Phe Arg Ala Val Asn lie Ala 
180 185 190 

cgt ttc ctt gat aat gaa gaa aac tgg ccg ctg tta ctt gat gcg ctt 624 
Arg Phe Leu Asp Asn Glu Glu Asn Trp Pro Leu Leu Leu Asp Ala Leu 
195 200 205 

att cct gtc gee aat acc tgc gaa atg ate ctg atg ccc gee tgc ttc 672 
lie Pro Val Ala Asn Thr Cys Glu Met lie Leu Met Pro Ala Cys Phe 
210 215 220 

ggt ctg gee gat gac aaa ctg tgg cgt tgg ttg aat gaa aaa eta cct 720 
Gly Leu Ala Asp Asp Lys Leu Trp Arg Trp Leu Asn Glu Lys Leu Pro 
225 230 235 240 

I 

tgt tea ctg atg ctt ttg cca acg ctg ccg cct tec gtg ctg ggc att 768 
Cys Ser Leu Met Leu Leu Pro Thr Leu Pro Pro Ser Val Leu Gly He 
245 250 255 

cgt ctg caa aac cag tta cag cgc cag ttt gtg cgc cag ggt ggc gtg 816 
Arg Leu Gin Asn Gin Leu Gin Arg Gin Phe Val Arg Gin Gly Gly Val 
260 265 270 

tgg atg ccg ggc gat gaa gtg aaa aaa gtg acc tgt aaa aat ggc gta 864 
Trp Met Pro Gly Asp Glu Val Lys Lys Val Thr Cys Lys Asn Gly Val 
275 280 285 

gtg aac gaa ate tgg acc cgc aat cac gee gat att ccg eta cgt cca 912 
Val Asn Glu He Trp Thr Arg Asn His Ala Asp He Pro Leu Arg Pro 
290 295 300 

cgt ttc gcg gtt etc gec age ggc agt ttc ttt agt ggc gga ctg gta 960 
Arg Phe Ala Val Leu Ala Ser Gly Ser Phe Phe Ser Gly Gly Leu Val 
305 310 315 320 

gcg gaa cgt aac ggc att cga gag ccg att etc ggc ctt gat gtg eta 1008 
Ala Glu Arg Asn Gly He Arg Glu Pro He Leu Gly Leu Asp Val Leu 
325 330 335 

caa acc gee acg egg ggt gaa tgg tat aag gga gat ttt ttt gcg ccg 1056 
Gin Thr Ala Thr Arg Gly Glu Trp Tyr Lys Gly Asp Phe Phe Ala Pro 
340 345 350 

caa ccg tgg cag cag ttc ggt gta acc act gat gag acg eta cgc ccg 1104 
Gin Pro Trp Gin Gin Phe Gly Val Thr Thr Asp Glu Thr Leu Arg Pro 
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355 360 365 

tea cag gca ggg caa acc att gaa aac ctg ttt gec ate ggt teg gtg 1152 
Ser Gin Ala Gly Gin Thr lie Glu Asn Leu Phe Ala lie Gly Ser Val 
370 375 380 

ctg ggc gga ttt gat ccc ate gee cag gga tgc ggc ggc ggt gtt tgt 1200 
Leu Gly Gly Phe Asp Pro He Ala Gin Gly Cys Gly Gly Gly Val Cys 
385 390 395 400 

gee gtc agt get tta cat gee get caa cag att gee caa cgc gca gga 1248 
Ala Val Ser Ala Leu His Ala Ala Gin Gin He Ala Gin Arg Ala Gly 
405 410 415 

ggc caa caa tga 1260 
Gly Gin Gin * 



<210> 223 
<211> 1191 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1191) 

<400> 223 

atg aat gac acc age ttc gaa aac tgc att aag tgc acc gtc tgc acc 48 
Met Asn Asp Thr Ser Phe Glu Asn Cys He Lys Cys Thr Val Cys Thr 
15 10 15 

acc gee tgc ccg gtg age egg gtg aat ccc ggt tat cca ggg cca aaa 96 
Thr Ala Cys Pro Val Ser Arg Val Asn Pro Gly Tyr Pro Gly Pro Lys 
20 25 30 

caa gee ggg ccg gat ggc gag cgt ctg cgt ttg aaa gat ggc gca ctg 144 
Gin Ala Gly Pro Asp Gly Glu Arg Leu Arg Leu Lys Asp Gly Ala Leu 
35 40 45 

tat gac gag gcg ctg aaa tat tgc ate aac tgc aaa cgt tgt gaa gtc 192 
Tyr Asp Glu Ala Leu Lys Tyr Cys He Asn Cys Lys Arg Cys Glu Val 
50 55 60 

gee tgc ccg tec gat gtg aag att ggc gat att ate cag cgc gcg egg 240 
Ala Cys Pro Ser Asp Val Lys He Gly Asp He He Gin Arg Ala Arg 
65 70 75 80 

gcg aaa tat gac acc acg cgc ccg teg ctg cgt aat ttt gtg ttg agt 288 
Ala Lys Tyr Asp Thr Thr Arg Pro Ser Leu Arg Asn Phe Val Leu Ser 
85 90 95 

cat acc gac ctg atg ggt age gtt tec acg ccg ttc gca cca ate gtc 336 
His Thr Asp Leu Met Gly Ser Val Ser Thr Pro Phe Ala Pro He Val 
100 105 110 

aac acc get acc teg ctg aaa ccg gtg egg cag ctg ctt gat gcg gcg 384 
Asn Thr Ala Thr Ser Leu Lys Pro Val Arg Gin Leu Leu Asp Ala Ala 
115 120 125 
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tta aaa ate gat cat cgc cgc acg eta ccg aaa tac tec ttc ggc acg 432 
Leu Lys He Asp His Arg Arg Thr Leu Pro Lys Tyr Ser Phe Gly Thr 
130 % 135 140 

ttc cgt cgc tgg tat cgc age gtg gcg get cag caa gca caa tat aaa 480 
Phe Arg Arg Trp Tyr Arg Ser Val Ala Ala Gin Gin Ala Gin Tyr Lys 
145 150 155 160 

gac cag gtc get ttc ttt cac ggc tgc ttc gtt aac tac aac cat ccg 528 • 
Asp Gin Val Ala Phe Phe His Gly Cys Phe Val Asn Tyr Asn His Pro 
165 170 175 

cag tta ggt aaa gat tta att aaa gtg etc aac gca atg ggt acc ggt 576 
Gin Leu Gly Lys Asp Leu He Lys Val Leu Asn Ala Met Gly Thr Gly 
180 185 190 

gta caa ctg etc age aaa gaa aaa tgc tgc ggc gta ccg eta ate gee 624 
Val Gin Leu Leu Ser Lys Glu Lys Cys Cys Gly Val Pro Leu He Ala 
195 200 205 

aac ggc ttt acc gat aaa gca cgc aaa cag gca att acg aat gta gag 672 
Asn Gly Phe Thr Asp Lys Ala Arg Lys Gin Ala He Thr Asn Val Glu 
210 215 220 

teg ate cgc gaa get gtg gga gta aaa ggc att ccg gtg att gee acc 720 
Ser He Arg Glu Ala Val Gly Val Lys Gly He Pro Val He Ala Thr 
225 230 235 240 

tec tea acc tgt aca ttt gee ctg cgc gac gaa tac ccg gaa gtg ctg 768 
Ser Ser Thr Cys Thr Phe Ala Leu Arg Asp Glu Tyr Pro Glu Val Leu 
245 250 255 

aat gtc gac aac aaa ggc ttg cgc gat cat ate gaa ctg gca acc cgc, 816 
Asn Val Asp Asn Lys Gly Leu Arg Asp His He Glu Leu Ala Thr Arg 
260 265 270 

tgg ctg tgg cgc aag ctg gac gaa ggc aaa acg tta ccg ctg aaa ccg 864 
Trp Leu Trp Arg Lys Leu Asp Glu Gly Lys Thr Leu Pro Leu Lys Pro 
275 280 285 

ctg ccg ctg aaa gtg gtt tat cac act ccg tgc cat atg gaa aaa atg 912 
Leu Pro Leu Lys Val Val Tyr His Tor Pro Cys His Met Glu Lys Met 
290 295 300 

ggc tgg acg etc tac acc ctg gag ctg ttg cgt aac ate ccg ggg ctt 960 
Gly Trp Thr Leu Tyr Thr Leu Glu Leu Leu Arg Asn He Pro Gly Leu 
305 310 315 320 

gag tta acg gtg ctg gat tec cag tgc tgc ggt att gcg ggt act tac 1008 
Glu Leu Thr Val Leu Asp Ser Gin Cys Cys Gly He Ala Gly Thr Tyr 
325 330 335 

ggt ttc aaa aaa gag aac tac ccc acc tea caa gec ate ggc gca cca 1056 
Gly Phe Lys Lys Glu Asn Tyr Pro Thr Ser Gin Ala He Gly Ala Pro 
340 345 350 

ctg ttc cgc cag at a gaa gaa age ggc gca gat ctg gtg gtc acc gac 1104 
Leu Phe Arg Gin He Glu Glu Ser Gly Ala Asp Leu Val Val Thr Asp 
355 360 365 

tgc gaa acc tgt aaa tgg cag att gag atg tec aca agt ctt cgc tgc 1152 
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Cys Glu Thr Cys Lys Trp Gin lie Glu Met Ser Thr Ser Leu Arg Cys 
370 375 380 

gaa cat ccg att acg eta ctg gec cag gcg ctg get taa 1191 
Glu His Pro He Thr Leu Leu Ala Gin Ala Leu Ala * 
385 390 395 



<210> 224 
<211> 909 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (909) 

<400> 224 

atg caa tta cgt aaa cct gec aca gca ate etc gec ctg gcg ctt tec 48 
Met Gin Leu Arg Lys Pro Ala Thr Ala He Leu Ala Leu Ala Leu Ser 
15 10 15 

gca gga ctg gca cag gca gat gac gec gec ccg gca gcg ggc agt act 96 
Ala Gly Leu Ala Gin Ala Asp Asp Ala Ala Pro Ala Ala Gly Ser Thr 
20 25 30 

ctg gac aaa ate gec aaa aac ggt gtg att gtc gtc ggt cac cgt gaa 144 
Leu Asp Lys He Ala Lys Asn Gly Val He Val Val Gly His Arg Glu 
35 40 45 

tct tea gtg cct ttc tct tat tac gac aat cag caa aaa gtg gtg ggt 192 
Ser Ser Val Pro Phe Ser Tyr Tyr Asp Asn Gin Gin Lys Val Val Gly 
50 55 60 

tac teg cag gat tac tec aac gec att gtt gaa gca gtg aaa aag aaa 240 
Tyr Ser Gin Asp Tyr Ser Asn Ala He Val Glu Ala Val Lys Lys Lys 
<65 70 75 80 

etc aac aaa ccg gac ttg cag gta aaa ctg att ccg att acc tea caa 288 
Leu Asn Lys Pro Asp Leu Gin Val Lys Leu He Pro He Thr Ser Gin 
85 90 95 

aac cgt att cca ctg ctg caa aac ggc act ttc gat ttt gaa tgt ggt 336 
Asn Arg He Pro Leu Leu Gin Asn Gly Thr Phe Asp Phe Glu Cys Gly 
100 105 110 

tct acc acc aac aac gtc gaa cgc caa aaa cag gcg get ttc tct gac 3B4 
Ser Thr Thr Asn Asn Val Glu Arg Gin Lys Gin Ala Ala Phe Ser Asp 
115 120 125 

act att ttc gtg gtc ggt acg cgc ctg ttg acc aaa aag ggt ggc gat 432 
Thr He Phe Val Val Gly Thr Arg Leu Leu Thr Lys Lys Gly Gly Asp 
130 135 140 

ate aaa gat ttt gec aac ctg aaa gac aaa gec gta gtc gtc act tec 480 
He Lys Asp Phe Ala Asn Leu Lys Asp Lys Ala Val Val Val Thr Ser 
145 150 155 160 

ggc act acc tct gaa gtt ttg etc aac aaa ctg aat gaa gag caa aaa 528 
Gly Thr Thr Ser Glu Val Leu Leu Asn Lys Leu Asn Glu Glu Gin Lys 
165 170 175 
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atg aat atg cgc ate ate age gee aaa gat cac ggt gac tct ttc cgc 576 
Met Asn Met Arg He He Ser Ala Lys Asp His Gly Asp Ser Phe Arg 
180 185 190 

acc ctg gaa age ggt cgt gee gtt gee ttt atg atg gat gac get ctg 624 
Thr Leu Glu Ser Gly Arg Ala Val Ala Phe Met Met Asp Asp Ala Leu 
195 200 205 

ctg gec ggt gaa cgt gcg aaa gcg aag aaa cca gac aac tgg gaa ate 672 
Leu Ala Gly Glu Arg Ala Lys Ala Lys Lys Pro Asp Asn Trp Glu He 
210 215 220 

gtc ggc aag ccg cag tct cag gag gec tac ggt tgt atg ttg cgt aaa 720 
Val Gly Lys Pro Gin Ser Gin Glu Ala Tyr Gly Cys Met Leu Arg Lys 
225 230 235 240 

gat gat ccg cag ttc aaa aag ctg atg gat gac acc ate get cag gtg 768 
Asp Asp Pro Gin Phe Lys Lys Leu Met Asp Asp Thr lie Ala Gin Val 
245 250 255 

cag acc tec ggt gaa gcg gaa aaa tgg ttt gat aag tgg ttc aaa aat 816 
Gin Thr Ser Gly Glu Ala Glu Lys Trp Phe Asp Lys Trp Phe Lys Asn 
260 265 270 

cca att ccg ccg aaa aac ctg aac atg aat ttc gaa ctg tea gac gaa 864 
Pro He Pro Pro Lys Asn Leu Asn Met Asn Phe Glu lieu Ser Asp Glu 
275 280 285 

atg aaa gca ctg ttc aaa gaa ccg aat gac aag gca ctg aac taa 909 
Met Lys Ala Leu Phe Lys Glu Pro Asn Asp Lys Ala Leu Asn * 
290 295 300 



<210> 225 
<211> 987 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (987) 

<400> 225 

atg aat aac age get ttt act ttc cag aca eta cac ccg gat acc ate 48 
Met Asn Asn Ser Ala Phe Thr Phe Gin Thr Leu His Pro Asp Thr He 
1 5 10 15 

atg gac get ctg ttt gag cat ggg ate egg gtg gat tec ggt ctt acc 96 
Met Asp Ala Leu Phe Glu His Gly He Arg Val Asp Ser Gly Leu Thr 
20 25 30 

ccg ctt aac age tat gaa aac cgt gtc tat caa ttt cag gac gaa gat 144 
Pro Leu Asn Ser Tyr Glu Asn Arg Val Tyr Gin Phe Gin Asp Glu Asp 
35 40 45 

cgt cga cgt ttt gtc gtc aaa ttt tat cgc cct gaa cgt tgg aca gee 192 
Arg Arg Arg Phe Val Val Lys Phe Tyr Arg Pro Glu Arg Trp Thr Ala 
50 55 60 
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gat caa ate etc gaa gaa cat caa ttt gcg ttg cag ctg gta aat gat 240 
Asp Gin lie Leu Glu Glu His Gin Phe Ala Leu Gin Leu Val Asn Asp 
65 70 75 80 

gaa gtt ccg gtc gca gca cct gtg gec ttt aac ggt cag act tta ttg 288 
Glu Val Pro Val Ala Ala Pro Val Ala Phe Asn Gly Gin Thr Leu Leu 
85 90 95 

aat cat cag gga ttt tat ttc get gtt ttt cca age gtc ggt ggt cgc 336 
Asn His Gin Gly Phe Tyr Phe Ala Val Phe Pro Ser Val Gly Gly Arg 
100 105 110 

cag ttc gaa get gat aat ate gat cag atg gaa gcg gtt ggg cgt tat 384 
Gin Phe Glu Ala Asp Asn He Asp Gin Met Glu Ala Val Gly Arg Tyr 
115 120 125 

tta ggg cgt atg cac cag acg ggg cgc aaa cag ctt ttt ate cat cgc 432 
Leu Gly Arg Met His Gin Thr Gly Arg Lys Gin Leu Phe He His Arg 
130 135 140 

ccg ace ate ggt ttg aac gaa tat etc att gag cca cgc aag ctg ttt 480 
Pro Thr He Gly Leu Asn Glu Tyr Leu He Glu Pro Arg Lys Leu Phe 
145 150 155 160 

gag gac get aca ctg ata cct tec ggg ttg aaa gcg gca ttc ctg aaa 528 
Glu Asp Ala Thr Leu He Pro Ser Gly Leu Lys Ala Ala Phe Leu Lys 
165 170 175 

gcg aca gat gag ctg att gee gee gtt aca gca cac tgg egg gaa gat 576 
Ala Thr Asp Glu Leu He Ala Ala Val Thr Ala His Trp Arg Glu Asp 
180 185 190 

ttc ace gtt ctg egg eta cat gga gac tgc cac gee ggg aat att etc 624 
Phe Thr Val Leu Arg Leu His Gly Asp Cys His Ala Gly Asn He Leu 
195 200 205 

tgg cgc gat ggt cca atg ttt gtt gat ctg gat gat gca cgt aat ggt 672 
Trp Arg Asp Gly Pro Met Phe Val Asp Leu Asp Asp Ala Arg Asn Gly 
210 215 220 

cca gee gtt cag gat ttg tgg atg ttg etc aat ggc gat aaa gee gag 720 
Pro Ala Val Gin Asp Leu Trp Met Leu Leu Asn Gly Asp Lys Ala Glu 
225 230 235 240 

cag egg atg caa ctg gaa act att att gaa get tat gaa gaa ttt age 768 
Gin Arg Met Gin Leu Glu Thr He He Glu Ala Tyr Glu Glu Phe Ser 
245 250 255 

gag ttc gac ace get gaa ate gga ctg att gaa cct tta cgc gee atg 816 
Glu Phe Asp Thr Ala Glu He Gly Leu He Glu Pro Leu Arg Ala Met 
260 265 270 

cgt ttg gtt tat tat ctt gee tgg eta atg egg cgt tgg get gat ccc 864 
Arg Leu Val Tyr Tyr Leu Ala Trp Leu Met Arg Arg Trp Ala Asp Pro 
275 280 285 

gcg ttc ccg aaa aat ttc ccg tgg tta acc ggg gaa gat tac tgg ctg 912 
Ala Phe Pro Lys Asn Phe Pro Trp Leu Thr Gly Glu Asp Tyr Trp Leu 
290 295 300 

cga cag acg gcg act ttt ata gaa cag gca aaa gtt eta caa gaa ccc 960 
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Arg Gin Thr Ala Thr Phe He Glu Gin Ala Lys Val Leu Gin Glu Pro 
305 310 315 320 

cct ttg caa tta aca cct atg tat taa 987 
Pro Leu Gin Leu Thr Pro Met Tyr * 
325 



<210> 226 
<211> 627 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (627) 

<400> 226 

atg aaa aag att tgg ctg gcg ctg get ggt tta gtt tta gcg ttt age 48 

Met Lys Lys He Trp Leu Ala Leu Ala Gly Leu Val Leu Ala Phe Ser 
1^5 10 15 

gca teg gcg gcg cag tat gaa gat ggt aaa cag tac act acc ctg gaa 96 
Ala Ser Ala Ala Gin Tyr Glu Asp Gly Lys Gin Tyr Thr Thr Leu Glu 
20 25 30 

aaa ccg gta get ggc gcg ccg caa gtg ctg gag ttt ttc tct ttc ttc 144 
Lys Pro Val Ala Gly Ala Pro Gin Val Leu Glu Phe Phe Ser Phe Phe 
35 40 45 

tgc ccg cac tgc tat cag ttt gaa gaa gtt ctg cat att tct gat aat 192 
Cys Pro His Cys Tyr Gin Phe Glu Glu Val Leu His He Ser Asp Asn 
50 " 55 60 

gtg aag aaa aaa ctg ccg gaa ggc gtg aag atg act aaa tac cac gtc 240 
Val Lys Lys Lys Leu Pro Glu Gly Val Lys Met Thr Lys Tyr His Val 
65 70 75 80 

aac ttc atg ggt ggt gac ctg ggc aaa gat ctg act cag gca tgg get 288 
Asn Phe Met Gly Gly Asp Leu Gly Lys Asp Leu Thr Gin Ala Trp Ala 
85 90 95 

gtg gcg atg gcg ctg ggc gtg gaa gac aaa gtg act gtt ccg ctg ttt 336 
Val Ala Met Ala Leu Gly Val Glu Asp Lys Val Thr Val Pro Leu Phe 
100 105 110 

gaa ggc gta cag aaa acc cag acc att cgt tct get tct gat ate cgc 384 
Glu Gly Val Gin Lys Thr Gin Thr ile Arg Ser Ala Ser Asp He Arg 
115 120 125 

gat gta ttt ate aac gca ggt att aaa ggt gaa gag tac gac gcg gcg 432 
Asp Val Phe He Asn Ala Gly He Lys Gly Glu Glu Tyr Asp Ala Ala 
130 135 ,140 

tgg aac age ttc gtg gtg aaa tct ctg gtc get cag cag gaa aaa get 480 
Trp Asn Ser Phe Val Val Lys Ser Leu Val Ala Gin Gin Glu Lys Ala 
145 150 155 160 

gca get gac gtg caa ttg cgt ggc gtt ccg gcg atg ttt gtt aac ggt 528 
Ala Ala Asp Val Gin Leu Arg Gly Val Pro Ala Met Phe Val Asn Gly 
165 170 175 
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aaa tat cag ctg aat ccg cag ggt atg gat acc age aat atg gat gtt 576 
Lys Tyr Gin Leu Asn Pro Gin Gly Met Asp Thr Ser Asn Met Asp Val 
180 185 190 

ttt gtt cag cag tat get gat aca gtg aaa tat ctg tec gag aaa aaa 624 
Phe Val Gin Gin Tyr Ala Asp Thr Val Lys Tyr Leu Ser Glu Lys Lys 
195 200 205 

taa 627 
* 



<210> 227 
<211> 339 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (339) 

<400> 227 

atg acg atg aat tea ttc gag cga agg aat aag ate ate caa tta gtg 48 
Met Thr Met Asn Ser Phe Glu Arg Arg Asn Lys lie lie Gin Leu Val 
15 10 15 

aat gaa cag gga acc gtg ctt gtt cag gat ctg gcg gga gta ttt get 96 
Asn Glu Gin Gly Thr Val Leu Val Gin Asp Leu Ala Gly Val Phe Ala 
20 25 30 

gee teg gaa gcg aca ate cgt gec gat ttg cgc ttt etc gaa caa aaa 144 
Ala Ser Glu Ala Thr lie Arg Ala Asp Leu Arg Phe Leu Glu Gin Lys 
35 40 45 

ggc gtg gtt acg cgc ttt cat ggc ggt gcg gcg aaa ata atg tct ggt 192 
Gly Val Val Thr Arg Phe His Gly Gly Ala Ala Lys lie Met Ser Gly 
50 55 60 

aat agt gaa acc gag acc cag gaa gtc ggg ttt aaa gag cga ttt cag 240 
Asn Ser Glu Thr Glu Thr Gin Glu Val Gly Phe Lys Glu Arg Phe Gin 
65 70 75 80 

etc gee age gcg cca aaa aac aga ata gcg cag gcg gca gtc aaa atg 288 
Leu Ala Ser Ala Pro Lys Asn Arg lie Ala Gin Ala Ala Val Lys Met 
85 90 95 

ate cac gaa ggg atg act gat cct acc cac gta ata tgg aca cag gee 336 
lie His Glu Gly Met Thr Asp Pro Thr His Val lie Trp Thr Gin Ala 
100 105 110 

taa 339 
* 



<210> 228 
<211> 1041 
<212> DMA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (1041) 

<400> 228 

atg aaa tea gtg gtg aat gat act gat ggt ate gtg cgc gtt gca gaa 48 
Met Lys Ser Val Val Asn Asp Thr Asp Gly lie Val Arg Val Ala Glu 
15 10 15 

age gtc att cct gaa att aaa cat cag gat gag gtg egg gta aaa att 96 
Ser Val lie Pro Glu He Lys His Gin Asp Glu Val Arg Val Lys He 
20 25 30 

gee age teg ggc tta tgt ggt tec gat tta ccc agg ata ttt aaa aat 144 
Ala Ser Ser Gly Leu Cys Gly Ser Asp Leu Pro Arg He Phe Lys Asn 
35 40 45 

ggt gca cat tat tat cca ata acg tta ggc cat gaa ttt age ggc tat 192 
Gly Ala His Tyr Tyr Pro He Thr Leu Gly His Glu Phe Ser Gly Tyr 
50 55 60 

att gat gcg gtg gga tec ggt gtt gat gat tta cac cct ggc gat gcg 240 
He Asp Ala Val Gly Ser Gly Val Asp Asp Leu His Pro Gly Asp Ala 
65 70 75 80 

gtt gee tgt gtg ccg tta tta ccc tgt ttt act tgt cca gag tgt ctg 288 
Val Ala Cys Val Pro Leu Leu Pro Cys Phe Thr Cys Pro Glu Cys Leu 
85 90 95 

aaa ggg ttt tat tec cag tgc gca aaa tat gat ttt att ggc teg egg 336 
Lys Gly Phe Tyr Ser Gin Cys Ala Lys Tyr Asp Phe He Gly Ser Arg 
100 105 ■ 110 

cgt gat ggt gga ttt get gaa tat att gtc gtt aag cga aaa aat gtc 384 
Arg Asp Gly Gly Phe Ala Glu Tyr He Val Val Lys Arg Lys Asn Val 
115 120 125 

ttt get eta ccc acg gat atg cct att gag gat ggg get ttt att gag 432 
Phe Ala Leu Pro Thr Asp Met Pro He Glu Asp Gly Ala Phe He Glu 
130 135 140 

ccg att ace gtt ggt ctg cat get ttt cat tta gcg caa ggt tgt gag 480 
Pro He Thr Val Gly Leu His Ala Phe His Leu Ala Gin Gly Cys Glu 
145 150 155 160 

aat aaa aac gtt att att att ggt gee gga acc att ggc ctg ctg gee 528 
Asn Lys Asn Val He He He Gly Ala Gly Thr He Gly Leu Leu Ala 
165 170 175 

att cag tgc get gtc gcg ctg gga gca aag agt gtg acg gcg ate gac 576 
He Gin Cys Ala Val Ala Leu Gly Ala Lys Ser Val Thr Ala He Asp 
180 185 190 

att agt tea gaa aaa ctg gca ctg gca aaa tct ttc ggt gcg atg caa 624 
He Ser Ser Glu Lys Leu Ala Leu Ala Lys Ser Phe Gly Ala Met Gin 
195 200 205 

aca ttt aac agt age gaa atg age gcg ccg caa atg cag age gtt tta 672 
Thr Phe Asn Ser Ser Glu Met Ser Ala Pro Gin Met Gin Ser Val Leu 
210 215 220 
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cgc gaa ctg cgc ttt aat cag ctt ate etc gag acg get ggc gta ccg 720 
Arg Glu Leu Arg Phe Asn Gin Leu lie Leu Glu Thr Ala Gly Val Pro 

225 230 235 240 

) 

caa act gtc gaa ctg gcg gta gag att gec ggt cct cat gee caa ctg 768 
Gin Thr Val Glu Leu Ala Val Glu lie Ala Gly Pro His Ala Gin Leu 
245 250 255 

gcg ctg gtg ggc acg ttg cat cag gat ctg cat tta aca teg gca acg 816 
Ala Leu Val Gly Thr Leu His Gin Asp Leu His Leu Thr Ser Ala Thr 
260 265 270 

ttt ggc aaa ata ttg cgt aaa gag ctg acg gtt ate ggc agt tgg atg 864 
Phe Gly Lys lie Leu Arg Lys Glu Leu Thr Val lie Gly Ser Trp Met 
275 280 285 

aac tac tec age cct tgg ccg ggg cag gag tgg gaa acg gcg age egg 912 
Asn Tyr Ser Ser Pro Trp Pro Gly Gin Glu Trp Glu Thr Ala Ser Arg 
290 295 300 

ttg ctg aca gaa cgt aag tta age ctg gag cca tta ate get cac cgt 960 
Leu Leu Thr Glu Arg Lys Leu Ser Leu Glu Pro Leu lie Ala His Arg 
305 310 315 320 

gga age ttt gaa age ttc gee cag gcg gtg cgt gac ate get cgt aat 1008 
Gly Ser Phe Glu Ser Phe Ala Gin Ala Val Arg Asp lie Ala Arg Asn 
325 330 335 

get atg ccg ggc aaa gtg ttg etc att ccc tga 1041 
Ala Met Pro Gly Lys Val Leu Leu lie Pro * 
340 345 



<210> 229 
<211> 1356 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1356) 

<400> 229 

atg ttt tea gaa gtc atg cgt tat att etc gac etc ggc cct acg gtg 48 
Met Phe Ser Glu Val Met Arg Tyr lie Leu Asp Leu Gly Pro Thr Val 
15 10 15 

atg ctg ccg att gtc ate att att ttt tct aaa ata tta ggc atg aag 96 
Met Leu Pro lie Val lie lie He Phe Ser Lys He Leu Gly Met Lys 
20 25 30 

gca ggc gat tgc ttt aaa gcg ggt ctg cat ate ggg att ggc ttt gtt 144 
Ala Gly Asp Cys Phe Lys Ala Gly Leu His He Gly He Gly Phe Val 
35 40 45 

ggc att ggc ctt gtg att ggc tta atg ctg gat tec att ggc ccg gcg 192 
Gly He Gly Leu Val He Gly Leu Met Leu Asp Ser He Gly Pro Ala 
50 55 60 

gcg aaa gcg atg gcg gaa aat ttc gac ctg aat ctg cat gtg gtc gat 240 
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Ala Lys Ala Met Ala Glu Asn Phe Asp Leu Asn Leu His Val Val Asp 
65 70 75 80 

gtt ggc tgg ccg ggc tct tea cca atg acc tgg gcg teg caa att gcg 288 
Val Gly Trp Pro Gly Ser Ser Pro Met Thr Trp Ala Ser Gin lie Ala 
85 90 95 

ctg gtg gcg att ccg att gcg att ctg gtt aac gtg gcg atg tta ctg 336 
Leu Val Ala He Pro He Ala He Leu Val Asn Val Ala Met Leu Leu 
100 105 110 

acc cgt atg acg egg gtg gta aat gtt gat ate tgg aat ate tgg cat 384 
Thr Arg Met Thr Arg Val Val Asn Val Asp He Trp Asn He Trp His 
115 120 125 

atg acc ttc acc ggc gcg ttg ctg cat ctg gca acc ggt tea tgg atg 432 
Met Thr Phe Thr Gly Ala Leu Leu His Leu Ala Thr Gly Ser Trp Met 
130 135 140 

ata ggg atg gca ggt gtg gta att cac gcg gcg ttt gtt tat aag etc 480 
He Gly Met Ala Gly Val Val He His Ala Ala Phe Val Tyr Lys Leu 
145 150 155 160 

ggc gac tgg ttt gee cgc gat acc cga aat ttc ttt gag ctg gaa ggt 528 
Gly Asp Trp Phe Ala Arg Asp Thr Arg Asn Phe Phe Glu Leu Glu Gly 
165 170 175 

att get att ccg cac ggt acg teg gcg tat atg ggg ccg att gcg gtg 576 
He Ala He Pro His Gly Thr Ser Ala Tyr Met Gly Pro He Ala Val 
180 185 190 

ctg gtc gat get ate ate gag aaa ate cca ggc gtt aac cga att aaa 624 
Leu Val Asp Ala He He Glu Lys He Pro Gly Val Asn Arg He Lys 
195 200 205 

ttt age gee gac gat att cag cgc aaa ttt ggt cca ttt ggc gag cct 672 
Phe Ser Ala Asp Asp He. Gin Arg Lys Phe Gly Pro Phe Gly Glu Pro 
210 215 220 

gtc acc gtg ggt ttt gtg atg ggg ctg att ate ggc ate etc gcg ggt 720 
Val Thr Val Gly Phe Val Met Gly Leu He He Gly He Leu Ala Gly 
225 230 235 240 

tac gat gtc aaa ggt gta ttg cag ctg gcg gta aaa acg gcg gca gtg 768 
Tyr Asp Val Lys Gly Val Leu Gin Leu Ala Val Lys Thr Ala Ala Val 
245 250 255 

atg ctg eta atg cca egg gtg att aaa ccc ate atg gat ggt tta acg 816 
Met Leu Leu Met Pro Arg Val He Lys Pro He Met Asp Gly Leu Thr 
260 265 270 

ccc ate get aag cag get cgt agt cgt tta cag gcg aag ttc ggc ggt 864 
Pro He Ala Lys Gin Ala Arg Ser Arg Leu Gin Ala Lys Phe Gly Gly 
275 280 285 

cag gag ttc ctg att ggc ctt gat ccg gcg ttg ctg ctg gga cat acg 912 
Gin Glu Phe Leu He Gly Leu Asp Pro Ala Leu Leu Leu Gly His Thr 
290 295 300 

gcg gtg gta teg gca age ctg att ttt ate cca etc acc att tta att 960 
Ala Val Val Ser Ala Ser Leu He Phe He Pro Leu Thr He Leu He 
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305 310 315 320 

get gtt tgt gtg ccg ggt aat cag gtg ctg ccg ttt ggc gat ctt gec 1008 
Ala Val Cys Val Pro Gly Asn Gin Val Leu Pro Phe Gly Asp Leu Ala 
325 330 335 

acc ate ggc ttc ttc gtg gcg atg gcg gtc gec gtg cat cgt gga aat 1056 
Thr He Gly Phe Phe Val Ala Met Ala Val Ala Val His Arg Gly Asn 
340 345 350 

ctg ttc cgc acc tta ate teg ggt gtc ate att atg age ate acc ctg 1104 
Leu Phe Arg Thr Leu He Ser Gly Val He He Met Ser He Thr Leu 
355 360 365 

tgg ate gcg acg caa act att ggt ttg cac acc caa ctg gcg get aat 1152 
Trp He Ala Thr Gin Thr He Gly Leu His Thr Gin Leu Ala Ala Asn 
370 375 380 

get ggg gcg tta aaa gee ggg ggt atg gtg get tea atg gat cag ggc 1200 
Ala Gly Ala Leu Lys Ala Gly Gly Met Val Ala Ser Met Asp Gin Gly 
385 390 395 400 

ggt tct ccc att acc tgg tta ctg att cag gtt ttc tec ccg caa aat 1248 
Gly Ser Pro He Thr Trp Leu Leu He Gin Val Phe Ser Pro Gin Asn 
405 410 415 

att ccc ggt ttc att att ate ggt gca att tat ctg acc ggt att ttc 1296 
He Pro Gly Phe He He He Gly Ala He Tyr Leu Thr Gly He Phe 
420 425 430 

atg acc tgg cgt aga gcg cgt ggc ttt att aaa caa gag aaa gtc gtt 1344 
Met Thr Trp Arg Arg Ala Arg Gly Phe He Lys Gin Glu Lys Val Val 
435 440 445 

etc gca gaa taa 1356 
Leu Ala Glu * 
450 



<210> 230 
<211> 285 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (285) 

<400> 230 

atg aaa cgc aag att att gtc get tgc gga ggc gcg gtt gcg acc tct 48 
Met Lys Arg Lys He He Val Ala Cys Gly Gly Ala Val Ala Thr Ser 
15 10 15 

acg atg gcg gcg gaa gaa att aaa gag ttg tgt cag aat cat aat att 96 
Thr Met Ala Ala Glu Glu He Lys Glu Leu Cys Gin Asn His Asn He 
20 25 30 

cct gtt gaa tta ate cag tgt egg gtt aat gaa ata gaa acc tat atg 144 
Pro Val Glu Leu He Gin Cys Arg Val Asn Glu He Glu Thr Tyr Met 
35 40 45 
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gat ggt gtg cat ttg ata tgc acc act gcc aaa gtg gat cgt agt ttt 192 
Asp Gly Val His Leu He Cys Thr Thr Ala Lys Val Asp Arg Ser Plie 
50 55 60 

ggc gat att ccg tta gtt cac ggc atg cct ttt att tct ggt ate ggt 240 
Gly Asp He Pro Leu Val His Gly Met Pro Phe He Ser Gly He Gly 
65 70 75 80 

ate gaa gca tta caa aat aaa att ctg act ate tta cag ggg tga 285 
He Glu Ala Leu Gin Asn Lys He Leu Thr He Leu Gin Gly * 
85 90 



<210> 231 
<211> 453 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(453) 

<400> 231 

atg act aac ctg ttt gtt cgt age gga att tct ttt gtc gat cgt age 48 
Met Thr Asn Leu Phe Val Arg Ser Gly He Ser Phe Val Asp Arg Ser 
1 5 10 15 

gaa gtt tta acc cat ate ggt aat gag atg etc gcc aaa ggt gtg gtt 96 
Glu Val Leu Thr His He Gly Asn Glu Met Leu Ala Lys Gly Val Val 
20 25 30 

cat gat acc tgg cca cag gca tta att gcc aga gaa gca gaa ttc cct 144 
His Asp Thr Trp Pro Gin Ala Leu He Ala Arg Glu Ala Glu Phe Pro 
35 40 45 

acc ggg ata atg ctt gag cag cac get att gca ata ccg cat tgt gag 192 
Thr Gly He Met Leu Glu Gin His Ala He Ala He Pro His Cys Glu 
50 55 60 

gcg att cat get aag teg tea gcc att tat ctg tta agg cca aca aat 240 
Ala He His Ala Lys Ser Ser Ala He Tyr Leu Leu Arg Pro Thr Asn 
65 70 75 80 

aaa gtt cat ttt cag caa gcg gat gat gat aac gac gtg gcg gta teg 288 
Lys Val His Phe Gin Gin Ala Asp Asp Asp Asn Asp Val Ala Val Ser 
85 90 95 



ttg gtt att gcg ttg att gtg gaa aat ccg cag cag caa ttg aaa ctt 336 
Leu Val He Ala Leu He Val Glu Asn Pro Gin Gin Gin Leu Lys Leu 
100 105 110 

tta cgc tgt tta ttt ggc aag tta caa cag ccc gat ate gtc gag aca 384 
Leu Arg Cys Leu Phe Gly Lys Leu Gin Gin Pro Asp He Val Glu Thr 
115 120 125 



eta ate act ctt cct gaa acc cag tta aag gaa tac ttc aca aag tat 432 
Leu He Thr Leu Pro Glu Thr Gin Leu Lys Glu Tyr Phe Thr Lys Tyr 
130 135 140 

gtt tta gat tea gac gaa taa 453 
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Val Leu Asp Ser Asp Glu 
145 150 



<210> 232 
<211> 1263 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1).. 



. (1263) 



<400> 232 

atg aaa acg tta att gcc egg cat aaa get ggt gaa cat ate ggc ata 
Met Lys Thr Leu He Ala Arg His Lys Ala Gly Glu His lie Gly He 
15 10 15 



48 



tgt tea gtc tgt tct gcc cat ccg ttg gtt ate gaa gcg gcg ctg gca 
Cys Ser Val Cys Ser Ala His Pro Leu Val He Glu Ala Ala Leu Ala 
20 25 30 



96 



ttt gat cgc aac age acg cgc aaa gtg ctg att gaa gca acg tea aac 
Phe Asp Arg Asn Ser Thr Arg Lys Val Leu He Glu Ala Thr Ser Asn 
35 40 45 



144 



cag gtc aat caa ttt ggc ggt tat ace gga atg aca ccg gca gac ttt 
Glu Val Asn Gin Phe Gly Gly Tyr Thr Gly Met Thr Pro Ala Asp Phe 
50 55 60 



192 



cgc gaa ttt gtt ttt acg att gcc gat aaa gtt ggg ttt gca cgc gaa 
Arg Glu Phe Val Phe Thr He Ala Asp Lys Val Gly Phe Ala Arg Glu 
65 70 75 80 



240 



cgc att - att etc ggc ggc gat cat ctg ggg cca aac tgc tgg cag caa 
Arg He He Leu Gly Gly Asp His Leu Gly Pro Asn Cys Trp Gin Gin 
85 90 95 



288 



gaa aat gcg gat gcg gcg atg gaa aaa tec gtc gag ctg gta aag gaa 
Glu Asn Ala Asp Ala Ala Met Glu Lys Ser Val Glu Leu Val Lys Glu 
100 105 110 



336 



tat gtt cgt gcc ggc ttc agt aaa att cat ctt gat gcg tea atg tec 
Tyr Val Arg Ala Gly Phe Ser Lys He His Leu Asp Ala Ser Met Ser 
115 120 125 



384 



tgc gcg ggg gat ccc ata ccg tta gca cca gaa acg gtt gcg gaa cga 
Cys Ala Gly Asp Pro He Pro Leu Ala Pro Glu Thr Val Ala Glu Arg 
130 135 140 



432 



get get gtg ctt tgc ttt get gcg gaa agt gtg gcg aca gat tgc cag 
Ala Ala Val Leu Cys Phe Ala Ala Glu Ser Val Ala Thr Asp Cys Gin 
145 150 155 160 



480 



cgt gag caa ctg age tat gtc att ggc acc gaa gtt ccg gtt ccg ggc 
Arg Glu Gin Leu Ser Tyr Val He Gly Thr Glu Val Pro Val Pro Gly 
165 170 175 



528 



SJSJt gag gcc age gcc att cag tea gta cac ate acc cat gtt gaa gat 
Gly Glu Ala Ser Ala He Gin Ser Val His He Thr His Val Glu Asp 
180 185 190 



576 
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gcc gcc aat act tta cgt acg cat caa aag gcc ttt att gcc cgt ggg 624 
Ala Ala Asn Thr Leu Arg Thr His Gin Lys Ala Phe He Ala Arg Gly 
195 200 205 

ctg aca gag gcg tta aca cgt gtg att gcc ate gtg gtg cag ccg ggt 672 
Leu Thr Glu Ala Leu Thr Arg Val He Ala He Val Val Gin Pro Gly 
210 215 220 

gtg gaa ttt gat cac age aat att ate cat tat cag ccg cag gaa gcg 720 
Val Glu Phe Asp His Ser Asn He He His Tyr Gin Pro Gin Glu Ala 
225 230 235 240 

cag ccg ctg gcg caa tgg at a gaa aac acc cga atg gtt tat gaa gca 768 
Gin Pro Leu Ala Gin Trp He Glu Asn Thr Arg Met Val Tyr Glu Ala 
245 250 255 

cat tct acc gat tac cag acc egg acg get tat tgg gaa tta gtc cgc 816 
His Ser Thr Asp Tyr Gin Thr Arg Thr Ala Tyr Trp Glu Leu Val Arg 
260 265 270 

gat cac ttt gca ata ttg aaa gtc ggt ccc gca tta acc ttt get tta 864 
Asp His Phe Ala He Leu Lys Val Gly Pro Ala Leu Thr Phe Ala Leu 
275 280 285 

cgc gag gcg ata ttt gca ctg gca caa att gag cag gaa ctt ate gcc 912 
Arg Glu Ala He Phe Ala Leu Ala Gin He Glu Gin Glu Leu He Ala 
290 295 300 

cct gaa aat cgc age ggt tgc ctg gcg gta att gaa gaa gtg atg etc 960 
Pro Glu Asn Arg Ser Gly Cys Leu Ala Val He Glu Glu Val Met Leu 
305 310 315 320 

gac gaa ccg caa tac tgg aaa aaa tat tat cgt acg ggt ttt aac gat 1008 
Asp Glu Pro Gin Tyr Trp Lys Lys Tyr Tyr Arg Thr Gly Phe Asn Asp 
325 330 335 

tea tta ctg gat att cgt tac age ctg teg gat cgt att cgt tat tac 1056 
Ser Leu Leu Asp He Arg Tyr Ser Leu Ser Asp Arg He Arg Tyr Tyr 
340 345 350 

tgg ccg cat agt egg att aaa aat age gtc gaa acg atg atg gtg aat 1104 
Trp Pro His Ser Arg He Lys Asn Ser Val Glu Thr Met Met Val Asn 
355 360 365 

ctt gaa ggc gtg gac ate cca ctg ggc atg att agt cag tat ctt ccc 1152 
Leu Glu Gly Val Asp He Pro Leu Gly Met He Ser Gin Tyr Leu Pro 
370 375 380 

aaa caa ttt gaa cgc att cag tec ggg gaa tta tea gca ata ccg cat 1200 
Lys Gin Phe Glu Arg He Gin Ser Gly Glu Leu Ser Ala He Pro His 
385 390 395 400 

cag ctg att atg gat aaa att tat gat gtt ttg cgc gcc tat cgc tac 1248 
Gin Leu He Met Asp Lys He Tyr Asp Val Leu Arg Ala Tyr Arg Tyr 
405 410 415 

ggc tgt gcg gaa taa 1263 
Gly Cys Ala Glu * 
420 
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<210> 233 
<211> 861 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(861) 

<400> 233 

atg aaa atg tac gtg gta teg aca aag cag atg ctg aac aac gca cag 48 
Met Lys Met Tyr Val Val Ser Thr Lys Gin Met Leu Asn Asn Ala Gin 
15 10 15 

cgc ggc ggt tat gcg gtt ccg gca ttc aat att cac aat etc gaa acg 96 
Arg Gly Gly Tyr Ala Val Pro Ala Phe Asn lie His Asn Leu Glu Thr 
20 25 30 

atg caa gtg gtg gta gaa acc get gec aac ctg cat gcg ccg gtc ate 144 
Met Gin Val Val Val Glu Thr Ala Ala Asn Leu His Ala Pro Val lie 
35 40 45 

ate gec gga acg cct ggc aca ttt act cat get ggt aca gaa aat ctg 192 
He Ala Gly Thr Pro Gly Thr Phe Thr His Ala Gly Thr Glu Asn Leu 
50 55 60 

tfc g 9 c g ct 9 gtc age gcg atg gcg aag caa tat cac cat cca ctg gca 240 
Leu Ala Leu Val Ser Ala Met Ala Lys Gin Tyr His His Pro Leu Ala 
65 70 75 80 

att cat etc gac cat cac acg aaa ttt gac gat ate get cag aag gtt 288 
He His Leu Asp His His Thr Lys Phe Asp Asp He Ala Gin Lys Val 
85 90 95 

cgt tct ggc gtg cgc tea gtc atg att gac gee teg cat ttg cct ttt 336 
Arg Ser Gly Val Arg Ser Val Met He Asp Ala Ser His Leu Pro Phe 
100 105 110 

gcg caa aat att tea egg gtc aaa gag gtg gtg gat ttt tgc cat cgc 384 
Ala Gin Asn He Ser Arg Val Lys Glu Val Val Asp Phe Cys His Arg 
115 120 125 

ttt gat gtc age gtc gag gcg gag ctg ggg caa ctt ggc ggc cag gaa 432 
Phe Asp Val Ser Val Glu Ala Glu Leu Gly Gin Leu Gly Gly Gin Glu 
130 135 140 

gat gat gtg caa gtc aat gaa gec gat gcg ttg tac acc aac ccc get 480 
Asp Asp Val Gin Val Asn Glu Ala Asp Ala Leu Tyr Thr Asn Pro Ala 
145 150 155 160 

cag gcg cgt gaa ttt gee gag gca acc gga att gat tec ctg gcg gtc 528 
Gin Ala Arg Glu Phe Ala Glu Ala Thr Gly He Asp Ser Leu Ala Val 
165 170 175 

gec ate ggc acg get cat ggg atg tat gee age gca ccg gcg ctt gat 576 
Ala He Gly Thr Ala His Gly Met Tyr Ala Ser Ala Pro Ala Leu Asp 
180 185 190 

ttt tct aga ctg gag aac att cgc cag tgg gtg aac tta ccg ctg gtg 624 
Phe Ser Arg Leu Glu Asn He Arg Gin Trp Val Asn Leu Pro Leu Val 
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195 200 205 

ctg cat ggc gcg tea ggg tta teg act aag gat att cag caa acc ate 672 
Leu His Gly Ala Ser Gly Leu Ser Thr Lys Asp He Gin Gin Thr lie 
210 215 220 

aaa ctg ggg ata tgc aaa ate aac gtt gca acg gag ctg aaa aat gee 720 
Lys Leu Gly He Cys Lys He Asn Val Ala Thr Glu Leu Lys Asn Ala 
225 230 235 240 

ttc teg cag gcg tta aaa aat tac ctg acc gag cac cct gaa gcg acc 768 
Phe Ser Gin Ala Leu Lys Asn Tyr Leu Thr Glu His Pro Glu Ala Thr 
245 250 255 

gat ccc egg gat tat ttg cag teg get aaa tec gca atg cgc gat gtg 816 
Asp Pro Arg Asp Tyr Leu Gin Ser Ala Lys Ser Ala Met Arg Asp Val 
260 265 270 

gtg age aaa gtg att gee gat tgt ggc tgc gag ggc agg gca taa 861 
Val Ser Lys Val He Ala Asp Cys Gly Cys Glu Gly Arg Ala * 
275 280 285 



<210> 234 
<211> 474 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (474) 

<400> 234 

atg agt caa aac gat ate att ate aga act cat tat aag tct cct cat 48 
Met Ser Gin Asn Asp He He He Arg Thr His Tyr Lys Ser Pro His 
1 5 10 .15 

aga ttg cac ate gat age gac ata cca aca cct tea tea gag cct att 96 
Arg Leu His He Asp Ser Asp He Pro Thr Pro Ser Ser Glu Pro He 
20 25 30 

aat caa ttt gcg cgc cag etc ate acc eta ctt gat acc tct gac tta 144 
Asn Gin Phe Ala Arg Gin Leu He Thr Leu Leu Asp Thr Ser Asp Leu 
35 40 " 45 

agt teg atg ctg tea tac tgt gtt act cag gaa ttt acc gca aac tgt 192 
Ser Ser Met Leu Ser Tyr Cys Val Thr Gin Glu Phe Thr Ala Asn Cys 
50 55 60 

cga aaa ata tea caa aat tgt tat tec act gee ctt ttt acc att aac 240 
Arg Lys He Ser Gin Asn Cys Tyr Ser Thr Ala Leu Phe Thr He Asn 
65 70 75 80 

ttt gee act tea ccc ate cat aca gaa aat ata etc att aca tta cac 288 
Phe Ala Thr Ser Pro He His Thr Glu Asn He Leu He Thr Leu His 
85 90 95 

tat aaa aaa gaa ate att tec tta tta ctg gaa acc acg cct att aaa 336 
Tyr Lys Lys Glu He He Ser Leu Leu Leu Glu Thr Thr Pro He Lys 
100 105 110 
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get aac cat ttg cga age ata ctg gat tat att gaa cag gaa cag tta 384 
Ala Asn His Leu Arg Ser He Leu Asp Tyr He Glu Gin Glu Gin Leu 
115 120 125 

act get gaa gat cgt aac cat tgt atg aaa ctg tct aaa aaa ate cat 432 
Thr Ala Glu Asp Arg Asn His Cys Met Lys Leu Ser Lys Lys He His 
130 135 140 



aga gaa aaa aac tat aca ccc aac agt aaa tct caa tgg tag 474 
Arg Glu Lys Asn Tyr Thr Pro Asn Ser Lys Ser Gin Trp * 
145 150 155 



<210> 235 
<211> 1416 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1416) 

<400> 235 

ttg caa tct cct tct gac get att ttt tgt cgc cat ctg tea ttg caa 48 
Met Gin Ser Pro Ser Asp Ala He Phe Cys Arg His Leu Ser Leu Gin 
15 10 15 

tac gee ctt gat tea ttg aga aat gga aaa ggc aaa gtc aac ctg att 96 
Tyr Ala Leu Asp Ser Leu Arg Asn Gly Lys Gly Lys Val Asn Leu He 
20 25 30 

aaa cat tac tec tec gtt gaa tec ata cag cag cat gtc ccc tta gtc 144 
Lys His Tyr Ser Ser Val Glu Ser He Gin Gin His Val Pro Leu Val 
35 40 45 

egg gac gcg gag ttc aga gca tta ctt cgc cat cct cct gca ggg agt 192 
Arg Asp Ala Glu Phe Arg Ala Leu Leu Arg His Pro Pro Ala Gly Ser 
50 55 60 

cgc gtt ate gcg agt aag gat ttt ggc ttc get tta gat att ttc ttt 240 
Arg Val He Ala Ser Lys Asp Phe Gly Phe Ala Leu Asp He Phe Phe 
65 70 75 BO 

tgt cga atg atg gca aac aat gtc agt cat atg tec gcg att tta tat 288 
Cys Arg Met Met Ala Asn Asn Val Ser His Met Ser Ala He Leu Tyr 
85 90 95 

ata gac aat cat act ttg tea gta agg eta cga ata aag cag tea gtg 336 
He Asp Asn His Thr Leu Ser Val Arg Leu Arg He Lys Gin Ser Val 
100 105 110 

tat gga caa tta aat tat gtt gtg tec gtt tac gac ccg aac gat acc 384 
Tyr Gly Gin Leu Asn Tyr Val Val Ser Val Tyr Asp Pro Asn Asp Thr 
115 120 125 

aac gtt gec gtc aga gac acc cac agg aca gca egg ggc ttt ctt teg 432 
Asn Val Ala Val Arg Asp Thr His Arg Thr Ala Arg Gly Phe Leu Ser 
130 135 140 
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ctt gat aag ttc ate agt tea ggt ccc gat get cag ace tgg get gat 480 
Leu Asp Lys Phe lie Ser Ser Gly Pro Asp Ala Gin Thr Trp Ala Asp 
145 150 155 160 

aga tat gtt cgc aac tgt gca att get att ctg ccc eta tta cct gtg 528 
Arg Tyr Val Arg Asn Cys Ala lie Ala lie Leu Pro Leu Leu Pro Val 
165 170 175 

gga gtt cca ggg get att ttc gcg ggt att gca tea cga atg cca ttt 576 
Gly Val Pro Gly Ala He Phe Ala Gly He Ala Ser Arg Met Pro Phe 
180 185 190 

gee cct ata cat cca teg gca atg ttg tta ata atg get aca ggc cag 624 
Ala Pro He His Pro Ser Ala Met Leu Leu He Met Ala Thr Gly Gin 
195 200 205 

tct caa cag ctt att aca tta ttc aaa cag ttg ccc ata etc cct gaa 672 
Ser Gin Gin Leu He Thr Leu Phe Lys Gin Leu Pro He Leu Pro Glu 
210 215 220 

aaa gaa ate att gaa ata ata act gcg cag aat age gtt ggt aca cct 720 
Lys Glu He He Glu He He Thr Ala Gin Asn Ser Val Gly Thr Pro 
225 230 235 240 

get tta ttt ttg get atg atg aac gga cat act gac aac gtg aaa ata 768 
Ala Leu Phe Leu Ala Met Met Asn Gly His Thr Asp Asn Val Lys He 
245 250 255 

ttt atg caa gaa att cag tea ctg gta gat aat cac ate att cat gaa 816 
Phe Met Gin Glu He Gin Ser Leu Val Asp Asn His He He His Glu 
260 265 270 

gat aat ctg gtt aaa tta ctg caa act aaa agt get aac gaa aca cct 864 
Asp Asn Leu Val Lys Leu Leu Gin Thr Lys Ser Ala Asn Glu Thr Pro 
275 280 285 

gga ctt tat ate tec atg ttg tat gga ttc gat gaa ata ate gat ate 912 
Gly Leu Tyr He Ser Met Leu Tyr Gly Phe Asp Glu He He Asp He 
290 295 300 

ttt ctg aat gca tta ace act cca ata gca caa gag ctt tta aac aaa 960 
Phe Leu Asn Ala Leu Thr Thr Pro He Ala Gin Glu Leu Leu Asn Lys 
305 310 315 320 

aaa ctg gtg atg agt att tta gee atg aaa ata cat gat ggt gag cca 1008 
Lys Leu Val Met Ser He Leu Ala Met Lys He His Asp Gly Glu Pro 
325 330 335 

gga tta tac gee gca atg gaa aat aat cac cct ttg tgt gtc aca egg 1056 
Gly Leu Tyr Ala Ala Met Glu Asn Asn His Pro Leu Cys Val Thr Arg 
340 345 350 

ttc etc tct aaa att aat ggc ate gee ttt aaa tac aag ttg age aaa 1104 
Phe Leu Ser Lys He Asn Gly He Ala Phe Lys Tyr Lys Leu Ser Lys 
355 360 365 

get aac ate atg gat tta tta aaa gga get aca gca cag gga ace cct 1152 
Ala Asn He Met Asp Leu Leu Lys Gly Ala Thr Ala Gin Gly Thr Pro 
370 375 380 

gca tta tac ate gee atg age aag ggt aat gaa gac gtc gtg tta tct 1200 
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Ala Leu Tyr lie Ala Met Ser Lys Gly Asn Glu Asp Val Val Leu Ser 
385 390 395 400 

tat ata teg acg ctg ggt get ttt gca aaa aaa cat tct ttt agt caa 1248 
Tyr lie Ser Thr Leu Gly Ala Phe Ala Lys Lys His Ser Phe Ser Gin 
405 410 415 

cat cag tta ttt aca eta ttg gec get aaa aat cat gac aac atg tea 1296 
His Gin Leu Phe Thr Leu Leu Ala Ala Lys Asn His Asp Asn Met Ser 
420 425 430 

get gtt cat ata gec att cat cat aag cat tat aaa act gta gaa aca 1344 
Ala Val His lie Ala lie His His Lys His Tyr Lys Thr Val Glu Thr 
435 440 445 

tat tat get get att aat gca ate age caa age ctg agt ttt agt get 1392 
Tyr Tyr Ala Ala lie Asn Ala lie Ser Gin Ser Leu Ser Phe Ser Ala 
450 455 460 

gat gaa ata aag acg tat tta taa 1416 
Asp Glu lie Lys Thr Tyr Leu * 
465 470 



<210> 236 
<211> 387 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . {387} 

<400> 236 

atg cca tct gga tta ttt atg gac tta ttg cct ttt tta ctg gac gcg 48 
Met Pro Ser Gly Leu Phe Met Asp Leu Leu Pro Phe Leu Leu Asp Ala 
15 10 15 

aac etc age gcg aca aac cca ccc get att ccg cac tgg tgg aag cgt 96 
Asn Leu Ser Ala Thr Asn Pro Pro Ala lie Pro His Trp Trp Lys Arg 
20 25 30 

caa ccg ctt att ccc aac ctt ctg tea cag gaa ctg aaa aac tat ctg 144 
Gin Pro Leu lie Pro Asn Leu Leu Ser Gin Glu Leu Lys Asn Tyr Leu 
35 40 45 

aag ctt aat gtt aaa gag aaa aat att cag att gca gac cag gta att 192 
Lys Leu Asn Val Lys Glu Lys Asn lie Gin lie Ala Asp Gin Val lie 
50 55 60 

att gat gaa act gca ggt gaa gtt gtt ate ggc gcg aat ace cgt att 240 
lie Asp Glu Thr Ala Gly Glu Val Val lie Gly Ala Asn Thr Arg lie 
65 70 75 80 

tgt cat ggt gee gtt att cag ggt ccg gta gtg att ggc gca aac tgc 288 
Cys His Gly Ala Val He Gin Gly Pro Val Val He Gly Ala Asn Cys 
85 90 95 

ctg ata ggt aat tgg att tgc ccc tat att tec aga cat ctg tta tea 336 
Leu He Gly Asn Trp He Cys Pro Tyr He Ser Arg His Leu Leu Ser 
100 105 no 
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ctt aac cca tta caa gcc cgc tgc cgc aga tat tec cgt ggc gag cga 384 
Leu Asn Pro Leu Gin Ala Arg Cys Arg Arg Tyr Ser Arg Gly Glu Arg 
115 120 125 

taa 387 
* 



<210> 237 
<211> 1197 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1197) 

<400> 237 

atg aaa acc tgg ata ttt ate tgt atg tec ata gca atg ttg eta tgg 48 
Met Lys Thr Trp lie Phe lie Cys Met Ser He Ala Met Leu Leu Trp 
1 5 " 10 15 

ttt tta agt acg eta aga cgt aaa ccc agt caa aag aaa ggc tgt att 96 
Phe Leu Ser Thr Leu Arg Arg Lys Pro Ser Gin Lys Lys Gly Cys He 
20 25 30 

gac gcc att ata cct gcg tat aac gaa ggc ccg tgt ctg gcg cag tea 144 
Asp Ala He He Pro Ala Tyr Asn Glu Gly Pro Cys Leu Ala Gin Ser 
35 40 45 

ctg gat aat eta ctg cgt aac cct tat ttt tgc egg gta att tgc gtt 192 
Leu Asp Asn Leu Leu Arg Asn Pro Tyr Phe Cys Arg Val He Cys Val 
50 55 60 

aac gac ggc tec acg gac aat acc gaa gcg gtc atg gcg gaa gtc aaa 240 
Asn Asp Gly Ser Thr Asp Asn Thr Glu Ala Val Met Ala Glu Val Lys 
65 70 75 80 

cgc aaa tgg ggc gac cgc ttt gtt gcc gtc acg caa aaa aat acc ggt 288 
Arg Lys Trp Gly Asp Arg Phe Val Ala Val Thr Gin Lys Asn Thr Gly 
85 90 95 

aaa ggt ggt gcg ctg atg aat ggc etc aat tac gcc acc tgc gac cag 336 
Lys Gly Gly Ala Leu Met Asn Gly Leu Asn Tyr Ala Thr Cys Asp Gin 
100 105 110 

gtt ttt tta agt gat gcc gac acc tat gtt ccg ccc gat caa gac gga 384 
Val Phe Leu Ser Asp Ala Asp Thr Tyr Val Pro Pro Asp Gin Asp Gly 
115 120 125 

atg ggc tat atg ctg gca gaa att gag cgc ggt gcc gat gcc gta ggc 432 
Met Gly Tyr Met Leu Ala Glu He Glu Arg Gly Ala Asp Ala Val Gly 
130 135 140 

ggc att ccc tct act gcg ttg aaa ggc gcg ggt ctg tta ccg cac ate 480 
Gly He Pro Ser Thr Ala Leu Lys Gly Ala Gly Leu Leu Pro His He 
145 150 155 160 

cgc gcg acc gta aag ttg ccg atg att gtt atg aag cgc acg eta cag 528 
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Arg Ala Thr Val Lys Leu Pro Met He Val Met Lys Arg Thr Leu Gin 
165 170 ~ 175 

cag etc ctg ggt ggc gca ccg ttt att ate age ggt gee tgc ggg atg 576 
Gin Leu Leu Gly Gly Ala Pro Phe He He Ser Gly Ala Cys Gly Met 
180 185 190 

ttc cgt act gat gta ttg cgt aag ttc ggt ttc teg gat cgt act aaa 624 
Phe Arg Thr Asp Val Leu Arg Lys Phe Gly Phe Ser Asp Arg Thr Lys 
195 200 205 

gtc gaa gac ctt gat etc ace tgg aca ttg gtg gca aac ggc tac cgt 672 
Val Glu Asp Leu Asp Leu Thr Trp Thr Leu Val Ala Asn Gly Tyr Arg 
210 215 220 

att egg cag gcg aat cgc tgc ate gta tac cca cag gaa tgc aac age 720 
He Arg Gin Ala Asn Arg Cys He Val Tyr Pro Gin Glu Cys Asn Ser 
225 230 235 240 

ccg cgt gag gag tgg cgt cgc tgg egg cgt tgg att gtg gga tac gcg 768 
Pro Arg Glu Glu Trp Arg Arg Trp Arg Arg Trp He Val Gly Tyr Ala 
245 250 255 

gtc tgt atg cgc ctg cat aaa aga ctt tta ttt age cgc ttc ggt ate 816 
Val Cys Met Arg Leu His Lys Arg Leu Leu Phe Ser Arg Phe Gly He 
260 265 270 

ttc agt ata ttt cct atg ctg ttg gtt gtg ctt tat ggc gtt ggg att 864 
Phe Ser He Phe Pro Met Leu Leu Val Val Leu Tyr Gly Val Gly He 
275 280 285 

tat etc act ace tgg ttt aat gaa ttc ate acc acc ggg ccg cat gga 912 
Tyr Leu Thr Thr Trp Phe Asn Glu Phe He Thr Thr Gly Pro His Gly 
290 295 300 

gtg gtg ttg gca atg ttt ccg ctt ate tgg gtc ggc gta gtt tgt gtt 960 
Val Val Leu Ala Met Phe Pro Leu He Trp Val Gly Val Val Cy3 Val 
305 310 315 320 

att ggt get ttt age gee tgg ttt cat cgt tgc tgg ttg ttg gtg cct 1008 
He Gly Ala Phe Ser Ala Trp Phe His Arg Cys Trp Leu Leu Val Pro 
325 330 335 

tta gcg ccg ctt tec gtt gtg tat gta tta tta get tat gee ate tgg 1056 
Leu Ala Pro Leu Ser Val Val Tyr Val Leu Leu Ala Tyr Ala He Trp 
340 345 350 

att att tat gga ctt att gee ttt ttt act gga cgc gaa cct cag cgc 1104 
He He Tyr Gly Leu He Ala Phe Phe Thr Gly Arg Glu Pro Gin Arg 
355 360 365 

gac aaa ccc acc cgc tat tec gca ctg gtg gaa gcg tea acc get tat 1152 
Asp Lys Pro Thr Arg Tyr Ser Ala Leu Val Glu Ala Ser Thr Ala Tyr 
370 375 380 

tec caa cct tct gtc aca gga act gaa aaa eta tct gaa get taa 1197 
Ser Gin Pro Ser Val Thr Gly Thr Glu Lys Leu Ser Glu Ala * 
385 390 395 
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<210> 238 
<211> 1659 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (l) . . . (1659) 

<400> 238 

atg att ctt gag cgc gtt gaa att gtg ggt ttt cgc ggt ate aac cgt 48 
Met lie Leu Glu Arg Val Glu He Val Gly Phe Arg Gly He Asn Arg 
15 10 15 

ttg teg ttg atg ctg gaa caa aac aac gtc ctg att ggg gag aac gcg 96 
Leu Ser Leu Met Leu Glu Gin Asn Asn Val Leu He Gly Glu Asn Ala 
20 25 30 

tgg ggt aaa tec age ttg ctg gac gee tta act ctg ctg eta teg cca 144 
Trp Gly Lys Ser Ser Leu Leu Asp Ala Leu Thr Leu Leu Leu Ser Pro 
35 40 45 

gaa tea gat etc tac cat ttt gag cgc gac gat ttc tgg ttc ccg ccg 192 
Glu Ser Asp Leu Tyr His Phe Glu Arg Asp Asp Phe Trp Phe Pro Pro 
50 55 60 

gga gat ate aac ggg cga gaa cat cat ctg cat att att ttg acc ttc 240 
Gly Asp He Asn Gly Arg Glu His His Leu His He He Leu Thr Phe 
65 70 75 80 

cgc gaa teg ctg cca ggc cga cat egg gtt cgc cgt tat egg ccg ctg 288 
Arg Glu Ser Leu Pro Gly Arg His Arg Val Arg Arg Tyr Arg Pro Leu 
85 90 95 

gaa gcg tgc tgg acg cca tgc acc gat ggc tat cac cgt att ttt tat 336 
Glu Ala Cys Trp Thr Pro Cys Thr Asp Gly Tyr His Arg He Phe Tyr 
100 105 110 

cgt ctg gaa ggg gag agt gcg gaa gac ggc age gtg atg aca ctg cgc 384 
Arg Leu Glu Gly Glu Ser Ala Glu Asp Gly Ser Val Met Thr Leu Arg 
115 120 125 

agt ttt etc gat aaa gac gga cat ccg att gat gtc gag gat att aac 432 
Ser Phe Leu Asp Lys Asp Gly His Pro He Asp Val Glu Asp He Asn 
130 135 140 

gat cag gca cgc cat ctg gtg cgt tta atg ccg gtg ctg cgc ttg cgt 480 
Asp Gin Ala Arg His Leu Val Arg Leu Met Pro Val Leu Arg Leu Arg 
145 150 155 160 

gat gec cgt ttt atg cgc cgt att cgt aac ggc acg gtg cca aat gtc 528 
Asp Ala Arg Phe Met Arg Arg He Arg Asn Gly Thr Val Pro Asn Val 
165 170 175 

cct aat gtg gaa gtc acc gcg cgc cag etc gat ttc etc gee cgt gag 576 
Pro Asn Val Glu Val Thr Ala Arg Gin Leu Asp Phe Leu Ala Arg Glu 
180 185 190 

tta tec tea cat ccg caa aat etc tct gat ggg cag att cgt cag gga 624 
Leu Ser Ser His Pro Gin Asn Leu Ser Asp Gly Gin He Arg Gin Gly 
195 200 205 
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ctt tec gca atg gta cag ctg ctt gag cat tat ttc tct gag cag ggg 672 
Leu Ser Ala Met Val Gin Leu Leu Glu His Tyr Phe Ser Glu Gin Gly 
210 215 220 

gec gga cag gcg cga tat cgt tta atg egg egg cga gee age aat gag 720 
Ala Gly Gin Ala Arg Tyr Arg Leu Met Arg Arg Arg Ala Ser Asn Glu 
225 230 235 240 

caa cga age tgg cgc tat ctg gat ate ate aac egg atg att gac cga 768 
Gin Arg Ser Trp Arg Tyr Leu Asp lie He Asn Arg Met He Asp Arg 
245 250 255 

cct ggt ggg cgc teg tat egg gtt att ttg etc ggc eta ttt get act 816 
Pro Gly Gly Arg Ser Tyr Arg Val He Leu Leu Gly Leu Phe Ala Thr 
260 265 270 

ttg ttg cag gca aaa ggc aca ttg cga ctg gat aaa gac gee cgt cca 864 
Leu Leu Gin Ala Lys Gly Thr Leu Arg Leu Asp Lys Asp Ala Arg Pro 
275 280 285 

ttg ttg ctg ate gaa gat cca gaa acc cgt tta cac ccc att atg ctt 912 
Leu Leu Leu He Glu Asp Pro Glu Thr Arg Leu His Pro He Met Leu 
290 295 300 

tea gtt gee tgg cat ctg ttg aat ctt ctg cca ttg cag cgc att gee 960 
Ser Val Ala Trp His Leu Leu Asn Leu Leu Pro Leu Gin Arg He Ala 
305 310 315 320 

acc acc aac teg ggt gag ttg ctt teg tta acg ccg gta gag cat gtt 1008 
Thr Thr Asn Ser Gly Glu Leu Leu Ser Leu Thr Pro Val Glu His Val 
325 330 335 

tgc cga ctg gta cgt gag tec teg cgc gtt gee gec tgg cgt ctg ggg 1056 
Cys Arg Leu Val Arg Glu Ser Ser Arg Val Ala Ala Trp Arg Leu Gly 
340 345 350 

ccg agt ggc ttg agt acc gaa gat age cga cgc ata tec ttt cac att 1104 
Pro Ser Gly Leu Ser Thr Glu Asp Ser Arg Arg He Ser Phe His He 
355 360 365 

cgt ttt aac cgt ccg tea teg ctg ttt gca cgc tgc tgg ttg ctg gtg 1152 
Arg Phe Asn Arg Pro Ser Ser Leu Phe Ala Arg Cys Trp Leu Leu Val 
370 375 380 

gaa ggg gaa acg gaa acc tgg gtt ate aat gaa ctg gcg cgt cag tgc 1200 
Glu Gly Glu Thr Glu Thr Trp Val He Asn Glu Leu Ala Arg Gin Cys 
385 390 395 400 

gga cat cat ttt gat gee gaa ggg ate aag gtc att gag ttt gec cag 1248 
Gly His His Phe Asp Ala Glu Gly He Lys Val He Glu Phe Ala Gin 
405 410 415 

tec ggg eta aag cca ctg gtt aaa ttt gee cgc cga atg ggg att gaa 1296 
Ser Gly Leu Lys Pro Leu Val Lys Phe Ala Arg Arg Met Gly He Glu 
420 425 430 

tgg cat gta ctg gtc gat ggc gat gaa gca ggg aag aaa tat gee get 1344 
Trp His Val Leu Val Asp Gly Asp Glu Ala Gly Lys Lys Tyr Ala Ala 
435 440 445 
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acg gta cgc age ctg ttg aat aac gat egg gaa gee gaa cga gaa cat 1392 
Thr Val Arg Ser Leu Leu Asn Asn Asp Arg Glu Ala Glu Arg Glu His 
450 455 460 

tta acg gcg tta ccg gcg ctg gat atg gaa cat ttt atg tat cgc cag 1440 
Leu Thr Ala Leu Pro Ala Leu Asp Met Glu His Phe Met Tyr Arg Gin 
465 470 475 480 

gga ttt tec gat gtg ttc cac cgc atg gcg caa ate ccg gaa aat gta 1488 
Gly Phe Ser Asp Val Phe His Arg Met Ala Glu lie Pro Glu Asa Val 
485 490 495 

ccg atg aat eta cgc aaa att ate teg aaa gcg ate cat cgc tct tec 1536 
Pro Met Asn Leu Arg Lys lie lie Ser Lys Ala lie His Arg Ser Ser 
500 505 510 

aaa ccc gat ctt gee att gaa gtg gca atg gag gca gga cgt cgt ggt 1584 
Lys Pro Asp Leu Ala He Glu Val Ala Met Glu Ala Gly Arg Arg Gly 
515 520 525 

gtg gac tec gta ccg acg ctg ctg aaa aaa atg ttc tea cgc gtg ctg 1632 
Val Asp Ser Val Pro Thr Leu Leu Lys Lys Met Phe Ser Arg Val Leu 
530 . 535 540 



tgg ctg gcg cgc ggt cgc gcg gat taa 1659 
Trp Leu Ala Arg Gly Arg Ala Asp * 
545 550 



<210> 239 
<211> 1059 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1059) 

<400> 239 

atg eta cca tct att tea ate aac aat ace age gca get tac cca gaa 48 

Met Leu Pro Ser He Ser He Asn Asn Thr Ser Ala Ala Tyr Pro Glu 
1 5 10 15 

tec ate aat gaa aat aac aat gat gaa gtt aat gga tta gta caa gag 96 
Ser He Asn Glu Asn Asn Asn Asp Glu Val Asn Gly Leu Val Gin Glu 
20 25 30 

ttc aaa aac ctt ttt aat ggt aag gaa gga ata age acc tgt att aaa 144 
Phe Lys Asn Leu Phe Asn Gly Lys Glu Gly He Ser Thr Cys He Lys 
35 40 45 

cat eta qtt gag ctt ata aaa aac gee ata cga gta aac gac gat cct 192 
His Leu Leu Glu Leu He Lys Asn Ala He Arg Val Asn Asp Asp Pro 
50 .55 60 

tat aga ttt aat att aat aat tec tea gtt act tat att gat att gac 240 
Tyr Arg Phe Asn He Asn Asn Ser Ser Val Thr Tyr He Asp He Asp 
65 70 75 80 

tec aat gat aca gac cat att act att ggt ate gac aac caa gaa cca 288 
Ser Asn Asp Thr Asp His He Thr He Gly He Asp Asn Gin Glu Pro 
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85 90 95 

ata gaa tta cct gcg aac tat aaa gac aaa gaa etc gtc cgt act ate 336 
He Glu Leu Pro Ala Asn Tyr Lys Asp Lys Glu Leu Val Arg Thr He 
100 105 110 

att aat gac aac ata gtt gag aag act cat gat ate aat aac aag gaa 384 
He Asn Asp Asn He Val Glu Lys Thr His Asp He Asn Asn Lys Glu 
115 120 125 

atg ate ttc age gca tta aaa gaa ata tat gat gga gat cct ggt ttt 432 
Met He Phe Ser Ala Leu Lys Glu He Tyr Asp Gly Asp Pro Gly Phe 
130 135 140 

ate ttc gat aag ata tea cac aaa etc aga cat acg gta acg gaa ttt 480 
He Phe Asp Lys He Ser His Lys Leu Arg His Thr Val Thr Glu Phe 
145 150 155 160 

gat gag age ggg aaa age gaa cca acg gac tta ttt ace tgg tac ggt 528 
Asp Glu Ser Gly Lys Ser Glu Pro Thr Asp Leu Phe Thr Trp Tyr Gly 
165 170 175 

aaa gat aaa aaa ggc gac tct etc get att gta att aaa aat aaa aac 576 
Lys Asp Lys Lys Gly Asp Ser Leu Ala He Val He Lys Asn Lys Asn 
180 185 190 

gga aat gat tac tta tct etc ggt tac tac gat cag gac gac tac cac 624 
Gly Asn Asp Tyr Leu Ser Leu Gly Tyr Tyr Asp Gin Asp Asp Tyr His 
195 . 200 205 

att caa aga gga att cgt att aat ggt gat agt etc acc caa tat tgt 672 
He Gin Arg Gly He Arg He Asn Gly Asp Ser Leu Thr Gin Tyr Cys 
210 215 220 

agt gaa aac gec agg agt get tea gcg tgg ttt gaa age agt aaa get 720 
Ser Glu Asn Ala Arg Ser Ala Ser Ala Trp Phe Glu Ser Ser Lys Ala 
225 230 235 240 

ate atg gca gaa tea ttt gca act ggt tec gat cat cag gtt gta aac 768 
He Met Ala Glu Ser Phe Ala Thr Gly Ser Asp His Gin Val Val Asn 
245 250 255 

gag etc aac ggg gaa aga ctg aga gaa cca aac gac gtt ttt aaa cgt 816 
Glu Leu Asn Gly Glu Arg Leu Arg Glu Pro Asn Asp Val Phe Lys Arg 
260 265 270 

tat ggt cga gca ata aga tat gat ttt caa gtg gac gat gca aaa tat 864 
Tyr Gly Arg Ala He Arg Tyr Asp Phe Gin Val Asp Asp Ala Lys Tyr 
275 280 285 

aaa tgc gac cat eta aaa gaa ata gtt tct act tta gtc ggt aac aaa 912 
Lys Cys Asp His Leu Lys Glu He Val Ser Thr Leu Val Gly Asn Lys 
290 295 300 

att aac gtt ggc cat tct caa aaa ata tat aag cat ttt aag gat etc 960 
He Asn Val Gly His Ser Gin Lys He Tyr Lys His Phe Lys Asp Leu 
305 310 315 320 

gaa ggt aaa att gaa gaa agg ctt caa aat cgc cag get gaa tat caa 1008 
Glu Gly Lys He Glu Glu Arg Leu Gin Asn Arg Gin Ala Glu Tyr Gin 
325 330 335 
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aat gaa att aat caa cca tct gcg cca ggt gtt aat ttt gat gat att 1056 
Asn Glu lie Asn Gin Pro Ser Ala Pro Gly Val Asn Phe Asp Asp lie 
340 345 350 

taa 1059 



<210> 240 
<211> 1128 
<2i2> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1128) 

<400> 240 

atg atg cgc cat tta cgc aat att ttt aat ctg ggt ate aaa gag ttg 48 
Met Met Arg His Leu Arg Asn lie Phe Asn Leu Gly lie Lys Glu Leu 
1 5 10 15 

cgc agt ctg etc ggt gat aaa gcg atg ctg acg ctg att gtc ttc teg 96 
Arg Ser Leu Leu Gly Asp Lys Ala Met Leu Thr Leu lie Val Phe Ser 
20 25 30 

ttt acg gtg teg gtg tat teg tea gcg ace gtt acg cca gga teg ttg 144 
Phe Thr Val Ser Val Tyr Ser Ser Ala Thr Val Thr Pro Gly Ser Leu 
35 40 45 

aac etc gcg ccg ate gee att gec gat atg gat caa teg cag tta teg 192 
Asn Leu Ala Pro He Ala He Ala Asp Met Asp Gin Ser Gin Leu Ser 
50 55 60 

aac egg ate gtt aac age ttc tat cgt ccg tgg ttt ttg cca ccg gag 240 
Asn Arg He Val Asn Ser Phe Tyr Arg Pro Trp Phe Leu Pro Pro Glu 
65 70 75 80 

atg ate acc gee gat gag atg gat gee gga ctg gac gee gga cgc tat 288 
Met He Thr Ala Asp Glu Met Asp Ala Gly Leu Asp Ala Gly Arg Tyr 
85 90 95 

acc ttc gcg ata aat att ccg cct aat ttt cag cgt gat gtc etc gee 336 
Thr Phe Ala He Asn He Pro Pro Asn Phe Gin Arg Asp Val Leu Ala 
100 105 110 

gga cgc cag ccg gat att cag gtg aac gtc gat gee acg cgc atg age 384 
Gly Arg Gin Pro Asp He Gin Val Asn Val Asp Ala Thr Arg Met Ser 
115 120 125 

cag gca ttt acc ggc aat ggg tat ate cag aat att ate aac ggt gaa 432 
Gin Ala Phe Thr Gly Asn Gly Tyr He Gin Asn He He Asn Gly Glu 
130 135 140 

gtg aac age ttt gtc gcg cgc tac cgt gat aac age gaa ccg ttg gta 4B0 
Val Asn Ser Phe Val Ala Arg Tyr Arg Asp Asn Ser Glu Pro Leu Val 
145 150 155 160 

teg ctg gaa acc egg atg cgc ttt aac ccg aac etc gat ccc gcg tgg 528 
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Ser Leu Glu Thr Arg Met Arg Phe Asn Pro Asn Leu Asp Pro Ala Trp 
165 170 175 

ttt ggc ggg gtg atg gcg ate ate aac aac att acc atg ctg gcg att 576 
Phe Gly Gly Val Met Ala lie He Asn Asn He Thr Met Leu Ala He 
180 IBS 190 

gta ttg acc gga teg gcg ctg ate cgc gag cgt gaa cac ggc acg gtg 624 
Val Leu Thr Gly Ser Ala Leu He Arg Glu Arg Glu His Gly Thr Val 
195 200 205 

gaa cac tta ctg gtg atg ccg ata acg ccg ttt gag ate atg atg gcg 672 
Glu His Leu Leu Val Met Pro He Thr Pro Phe Glu He Met Met Ala 
210 215 220 

aag ate tgg teg atg ggg ctg gtg gtg ctg gtg gta teg gga tta teg 720 
Lys He Trp Ser Met Gly Leu Val Val Leu Val Val Ser Gly Leu Ser 
225 230 235 240 

ctg gtg ctg atg gtg aaa ggt gta ctg ggc gta ccg att gaa ggc teg 768 
Leu Val Leu Met Val Lys Gly Val Leu Gly Val Pro He Glu Gly Ser 
245 250 255 

ate ccg ctg ttt atg ctg ggc gtg gcg etc agt ctg ttt gee acc acg 816 
He Pro Leu Phe Met Leu Gly Val Ala Leu Ser Leu Phe Ala Thr Thr 
260 265 270 

tea ate ggc att ttt atg ggg acg ata gcg cgt tea atg ccg caa ctg 864 
Ser He Gly He Phe Met Gly Thr He Ala Arg Ser Met Pro Gin Leu 
275 280 285 

ggg ctg ctg gtg att ctg gtg ctg ctg ccg ctg caa atg ctt tec ggt 912 
Gly Leu Leu Val He Leu Val Leu Leu Pro Leu Gin Met Leu Ser Gly 
290 295 300 

ggt tec acg ccg cgc gaa agt atg ccg cag atg gtg cag gac att atg 960 
Gly Ser Thr Pro Arg Glu Ser Met Pro Gin Met Val Gin Asp He Met 
305 310 315 320 

ctg acc atg ccg acg aca cac ttt gtt age etc gcg cag gee ate etc 1008 
Leu Thr Met Pro Thr Thr His Phe Val Ser Leu Ala Gin Ala He Leu 
325 330 335 

tac egg ggt gee gga ttc gaa ate gtc tgg ccg cag ttt ctg acg ctg 1056 
Tyr Arg Gly Ala Gly Phe Glu He Val Trp Pro Gin Phe Leu Thr Leu 
340 345 350 

a tg gca att ggc ggc gca ttt ttc acc att gcg ctg ctg cga ttc agg 1104 
Met Ala He Gly Gly Ala Phe Phe Thr He Ala Leu Leu Arg Phe Arg 
355 360 365 

aag acg att ggg aca atg gcg taa 1128 
Lys Thr He Gly Thr Met Ala * 
370 375 



<210> 241 
<211> 2685 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1)...(2685) 

<400> 241 

gtg age cag cat tat gga aaa acc gtt gcg ctg aac aat ate act etc 48 
Met Ser Gin His Tyr Gly Lys Thr Val Ala Leu Asn Asn lie Thr Leu 
15 10 15 

gat att ccg gee cgc tgt atg gtc ggg ctg att ggc ccg gac ggc gtc 96 
Asp lie Pro Ala Arg Cys Met Val Gly Leu lie Gly Pro Asp Gly Val 
20 25 30 

ggg aag teg age ttg ttg teg ttg att tec ggt gee cgc gtc att gaa 144 
Gly Lys Ser Ser Leu Leu Ser Leu lie Ser Gly Ala Arg Val lie Glu 
35 40 45 

cag ggc aat gtg atg gtg ctg ggc ggc gat atg cgc gac ccg aag cat 192 
Gin Gly Asn Val Met Val Leu Gly Gly Asp Met Arg Asp Pro Lys His 
50 55 60 

cgc cgc gac gtc tgc ccg cgc ate gee tgg atg ccg cag ggg ctg ggc 240 
Arg Arg Asp Val Cys Pro Arg lie Ala Trp Met Pro Gin Gly Leu Gly 
65 70 75 80 

aaa aac etc tac cac acc ttg teg gtg tat gaa aac gtc gat ttt ttc 288 
Lys Asn Leu Tyr His Thr Leu Ser Val Tyr Glu Asn Val Asp Phe Phe 
85 90 95 

get cgc ctg ttc ggt cac gac aaa gcg gag egg gaa gtg cga ate aat 336 
Ala Arg Leu Phe Gly His Asp Lys Ala Glu Arg Glu Val Arg lie Asn 
100 105 110 

gag ctg ctg acc age acc ggg tta gca ccg ttt cgc gat cgt ccg gca 384 
Glu Leu Leu Thr Ser Thr Gly Leu Ala Pro Phe Arg Asp Arg Pro Ala 
115 120 125 

ggg aaa etc tec ggc ggg atg aag caa aaa ctt ggg ctg tgc tgc gcg 432 
Gly Lys Leu Ser Gly Gly Met Lys Gin Lys Leu Gly Leu Cys Cys Ala 
130 135 140 

tta ate cac gac ccg gaa ctg ttg ate ctt gat gag cca aca acg ggg 480 
Leu lie His Asp Pro Glu Leu Leu lie Leu Asp Glu Pro Thr Thr Gly 
145 150 155 160 

gtt gac ccg etc tec cgc tec cag ttc tgg gat ctg ate gac agt att 528 
Val Asp Pro Leu Ser Arg Ser Gin Phe Trp Asp Leu lie Asp Ser lie 
165 170 175 

cgc cag egg cag age aat atg age gtg ctg gtc gee acc gec tat atg 576 
Arg Gin Arg Gin Ser Asn Met Ser Val Leu Val Ala Thr Ala Tyr Met 
180 185 190 

gaa gag gee gaa cgc ttc gac tgg ctg gta gcg atg aat gee gga gaa 624 
Glu Glu Ala Glu Arg Phe Asp Trp Leu Val Ala Met Asn Ala Gly Glu 
195 200 205 

gtg ctg gca act ggc age gee gaa gag eta egg cag caa acg caa age 672 
Val Leu Ala Thr Gly Ser Ala Glu Glu Leu Arg Gin Gin Thr Gin Ser 
210 215 220 
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get acg ctg gaa gaa gca ttt ata aat ctg tta ccg caa gcg caa cgc 720 
Ala Thr Leu Glu Glu Ala Phe lie Asn Leu Leu Pro Gin Ala Gin Arg 
225 230 235 240 

cag gcg cat cag gcg gta gtg ate cca ccg tat caa cct gaa aac gca 768 
Gin Ala His Gin Ala Val Val lie Pro Pro Tyr Gin Pro Glu Asn Ala 
245 250 255 

gag att gec ate gaa gcg cgc gat ctg acc atg cgt ttt ggt tec ttc 816 
Glu lie Ala lie Glu Ala Arg Asp Leu Thr Met Arg Phe Gly Ser Phe 
260 265 270 

gtt gec gtt gat cac gtt aat ttc cgc att cca cgc ggg gag att ttt 864 
Val Ala Val Asp His Val Asn Phe Arg lie Pro Arg Gly Glu lie Phe 
275 280 285 

ggt ttt ctt ggt teg aac ggc tgc ggt aaa tec acc acc atg aaa atg 912 
Gly Phe Leu Gly Ser Asn Gly Cys Gly Lys Ser Thr Thr Met Lys Met 
290 295 300 

etc acc gga ctg ctg ccc gec age gaa ggt gag gcg tgg ctg ttc ggg 960 
Leu Thr Gly Leu Leu Pro Ala Ser Glu Gly Glu Ala Trp Leu Phe Gly 
305 310 315 320 

caa ccg gtt gat cca aaa gat ate gat acc cgc cgt egg gtg ggc tat 1008 
Gin Pro Val Asp Pro Lys Asp lie Asp Thr Arg Arg Arg Val Gly Tyr 
325 330 335 

atg teg cag gcg ttt teg etc tat aac gaa etc acc gtg egg caa aac 1056 
Met Ser Gin Ala Phe Ser Leu Tyr Asn Glu Leu Thr Val Arg Gin Asn 
340 345 350 

ctt gag tta cat gee cgt ttg ttt cac ate ccg gaa gcg gaa att ccc 1104 
Leu Glu Leu His Ala Arg Leu Phe His lie Pro Glu Ala Glu lie Pro 
355 360 365 

gca aga gtg get gaa atg age gag cgt ttt aag etc aac gac gtt gaa 1152 
Ala Arg Val Ala Glu Met Ser Glu Arg Phe Lys Leu Asn Asp Val Glu 
■■ 370 375 380 

gat att ctg ccg gag tea ttg ccg etc ggc att cgc cag egg ctt teg 1200 
Asp He Leu Pro Glu Ser Leu Pro Leu Gly He Arg Gin Arg Leu Ser 
385 390 395 400 

ctg gcg gtg gcg gtg att cat cgc ccg gag atg tta ate etc gat gag 1248 
Leu Ala Val Ala Val He His Arg Pro Glu Met Leu He Leu Asp Glu 
405 410 415 

cct act tct ggt gtc gat ccg gtg gcg agg gat atg ttc tgg cag ttg 1296 
Pro Thr Ser Gly Val Asp Pro Val Ala Arg Asp Met Phe Trp Gin Leu 
420 425 430 

atg gtc gat etc teg cgc cag gac aaa gtg act ate ttc ate tec acc 1344 
Met Val Asp Leu Ser Arg Gin Asp Lys Val Thr He Phe He Ser Thr 
435 440 445 

cac ttt atg aac gaa gcg gaa cgt tgc gac cgc ate tea ctg atg cac 1392 
His Phe Met Asn Glu Ala Glu Arg Cys Asp Arg He Ser Leu Met His 
450 455 460 

gec gga aaa gtg ctt gec age ggt aca ccg cag gaa ctg gtt gag aaa 1440 
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Ala Gly Lys Val lieu Ala Ser Gly Thr Pro Gin Glu Leu Val Glu Lys 
465 470 475 480 

cgc gga gcc gcc agt ctg gaa gag gca ttt ate gec tat ttg cag gaa 1488 
Arg Gly Ala Ala Ser Leu Glu Glu Ala Phe lie Ala Tyr Leu Gin Glu 
485 490 495 

gcg gca ggg cag age aac gaa gcc gaa gcg ccg ccc gtg gta cac gac 1536 
Ala Ala Gly Gin Ser Asn Glu Ala Glu Ala Pro Pro Val Val His Asp 
500 505 510 

acc acc cac gcg ccg cgt cag gga ttt age ctg cgc cgt ctg ttt age 1584 
Thr Thr His Ala Pro Arg Gin Gly Phe Ser Leu Arg Arg Leu Phe Ser 
515 520 525 

tac age cgc cgc gaa gcg ctg gaa ctg cga cgc gat cca gta cgt teg 1632 
Tyr Ser Arg Arg Glu Ala Leu Glu Leu Arg Arg Asp Pro Val Arg Ser 
530 535 540 

acg ctg gcg ctg atg gga acg gtg ate ctg atg ctg ata atg ggt tac 1680 
Thr Leu Ala Leu Net Gly Thr Val He Leu Met Leu He Met Gly Tyr 
545 550 555 560 

ggc ate agt atg gat gtg gaa aac ctg cgc ttt gcg gtg etc gac cgc 1728 
Gly He Ser Met Asp Val Glu Asn Leu Arg Phe Ala Val Leu Asp Arg 
565 570 575 

gac cag acc gtc agt age cag gcg tgg aca etc aac etc tec ggt tec 1776 
Asp Gin Thr Val Ser Ser Gin Ala Trp Thr Leu Asn Leu Ser Gly Ser 
580 585 590 

cgt tac ttt ate gaa cag ccg ccg etc acc agt tat gac gag ctt gat 1824 
Arg Tyr Phe He Glu Gin Pro Pro Leu Thr Ser Tyr Asp Glu Leu Asp 
595 600 605 

cgt egg atg cgt gcg ggc gat ate acg gtg gcg att gag ate ccg ccc 1872 
Arg Arg Met Arg Ala Gly Asp He Thr Val Ala He Glu He Pro Pro 
610 615 620 

aat ttc ggg cgc gat ate gcg cgt ggt acg cct gtg gaa etc ggc gtc 1920 
Asn Phe Gly Arg Asp He Ala Arg Gly Thr Pro Val Glu lieu Gly Val 
625 630 635 640 

tgg ate gac gga gcg atg ccg age cgt get gaa acg gta aaa ggt tac 1968 
Trp He Asp Gly Ala Met Pro Ser Arg Ala Glu Thr Val Lys Gly Tyr 
645 650 655 

gtg cag gcc atg cac cag age tgg tta cag gat gtg gcg age cga caa 2016 
Val Gin Ala Met His Gin Ser Trp Leu Gin Asp Val Ala Ser Arg Gin 
660 665 670 

teg aca ccc gcc age caa age ggg ctg atg aat att gag acg cgc tat 2064 
Ser Thr Pro Ala Ser Gin Ser Gly Leu Met Asn He Glu Thr Arg Tyr 
675 680 685 

cgc tat aac ccg gac gta aaa age ctg cca gcg att gtt ccg gcg gtg 2112 
Arg Tyr Asn Pro Asp Val Lys Ser Leu Pro Ala He Val Pro Ala Val 
690 695 700 

ate ccg ctt ctg ctg atg atg ate ccg tea atg eta age gcc ctt age 2160 
He Pro Leu Leu Leu Met Met He Pro Ser Met Leu Ser Ala Leu Ser 
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705 710 715 720 

gtg gtg egg gaa aaa gag ctt ggg teg att ate aac ctt tac gtg acc 2208 

Val Val Arg Glu Lys Glu Leu Gly Ser lie lie Asn Leu Tyr Val Thx 

725 730 735 

ccc acc acg cgt agt gaa ttt ttg ctt ggt aaa cag ttg cca tac ate 2256 
Pro Thr Thr Arg Ser Glu Phe Leu Leu Gly Lys Gin Leu Pro Tyr lie 
740 745 750 

gcg ctg ggg atg ctg aac ttt ttc ctg etc tgc ggc ctg teg gtg ttt 2304 

Ala Leu Gly Met Leu Asn Phe Phe Leu Leu Cys Gly Leu Ser Val Phe 
755 760 765 

gtg ttt ggc gta ccg cat aaa ggc agt ttc ctg acg etc acc ctg gcg 2352 

Val Phe Gly Val Pro His Lys Gly Ser Phe Leu Thr Leu Thr Leu Ala 
770 775 780 

gcg ctg ctg tat ate ate att gee acc gga atg ggg ctg ctg ate tec 2400 

Ala Leu Leu Tyr lie lie lie Ala Thr Gly Met Gly Leu Leu lie Ser 
785 790 795 800 

acc ttt atg aaa age cag att gee gee att ttc gga acg gcg att ate 2448 

Thr Phe Met Lys Ser Gin lie Ala Ala lie Phe Gly Thr Ala lie lie 

805 810 815 

acg ttg ate ccg gcg aca cag ttt tec ggg atg ate gat ccg gta get 2496 
Thr Leu lie Pro Ala Thr Gin Phe Ser Gly Met lie Asp Pro Val Ala 
820 825 830 

teg ctg gaa ggg cct gga cgt tgg ate ggc gag gtt tac ccg acc agt 2544 

Ser Leu Glu Gly Pro Gly Arg Trp lie Gly Glu Val Tyr Pro Thr Ser 
835 840 845 



cat ttt ctg act ate gee cgc ggg acg ttc teg aaa gcg ctg gat ctg 2592 
His Phe Leu Thr He Ala Arg Gly Thr Phe Ser Lys Ala Leu Asp Leu 
850 855 860 

act gat ttg tgg caa ctt ttt ate ccg tta ctg ata gee ate ccg ctg 2640 
Thr Asp Leu Trp Gin Leu Phe He Pro Leu Leu He Ala He Pro Leu 
865 870 875 880 

gtg atg ggc tta agt ate ctg ctg ctg aaa aaa cag gag gga tga 2685 
Val Met Gly Leu Ser He Leu Leu Leu Lys Lys Gin Glu Gly * 
885 890 



<210> 242 
<211> 1068 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1068) 

<400> 242 

atg gat aag agt aag cgc cat ctg gcg tgg tgg gtt gtc ggg tta ctg 48 

Met Asp Lys Ser Lys Arg His Leu Ala Trp Trp Val Val Gly Leu Leu 
15 10 15 
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gcg gtg gcg get ate gtg gcg tgg tgg ctg ttg cgc ccg gca ggt gtg 96 
Ala Val Ala Ala lie Val Ala Trp Trp Leu Leu Arg Pro Ala Gly Val 
20 25 30 

ccg gaa ggc ttt get gtc agt aat ggg cgc att gaa gcg acg gaa gtg 144 
Pro Glu Gly Phe Ala Val Ser Asn Gly Arg lie Glu Ala Thr Glu Val 
35 40 45 

gat att gec age aaa att gec ggg cgt ate gac acc att ctg gtg aaa 192 
Asp He Ala Ser Lys He Ala Gly Arg He Asp Thr He Leu Val Lys 
50 55 60 

gaa ggc aag ttt gtt cgc gaa ggt gaa gtg ctg gcg aag atg gat act 240 
Glu Gly Lys Phe Val Arg Glu Gly Glu Val Leu Ala Lys Met Asp Thr 
65 70 75 80 

cgc gtg ttg cag gaa cag cga ctg gaa gec ate gcg caa ate aaa gag 288 
Arg Val Leu Gin Glu Gin Arg Leu Glu Ala He Ala Gin He Lys Glu 
85 90 95 

gca caa age gec gtt get gee gcg cag get ttg ctg gag caa cga caa 336 
Ala Gin Ser Ala Val Ala Ala Ala Gin Ala Leu Leu Glu Gin Arg Gin 
100 105 110 

age gaa act cgt gee gca cag teg ctg gtt aat caa cgc cag gca gaa 384 
Ser Glu Thr Arg Ala Ala Gin Ser Leu Val Asn Gin Arg Gin Ala Glu 
115 120 125 

ctg gac tec gta gca aaa cgt cat acg cgt tec cgt tea ctg gee caa 432 
Leu Asp Ser Val Ala Lys Arg His Thr Arg Ser Arg Ser Leu Ala Gin 
130 135 140 

cga ggg get att tct gcg caa cag ctg gat gac gat cgc gee gee get 480 
Arg Gly Ala He Ser Ala Gin Gin Leu Asp Asp Asp Arg Ala Ala Ala 
145 " 150 155 160 

gag age gec cga get gcg ctg gaa teg gcg aaa get cag gta teg get 528 
Glu Ser Ala Arg Ala Ala Leu Glu Ser Ala Lys Ala Gin Val Ser Ala 
165 170 175 

tct aaa gcg get ata gaa gcg gca cgc acc aat ate att cag gcg caa 576 
Ser Lys Ala Ala He Glu Ala Ala Arg Thr Asn He He Gin Ala Gin 
180 185 190 

acc cgc gtc gaa gcg gca caa gee act gaa egg cgc att gee gca gat 624 
Thr Arg Val Glu Ala Ala Gin Ala Thr Glu Arg Arg He Ala Ala Asp 
195 200 205 

ate gat gac age gaa ctg aaa gec ccg cgt gac gga cgc gtg cag tat 672 
He Asp Asp Ser Glu Leu Lys Ala Pro Arg Asp Gly Arg Val Gin Tyr 
210 215 220 

egg gtt gec gag cca ggc gaa gtg ctg gcg gca ggc ggt egg gtg ctg 720 
Arg Val Ala Glu Pro Gly Glu Val Leu Ala Ala Gly Gly Arg Val Leu 
225 230 235 240 

aat atg gtc gat etc age gac gtc tat atg act ttc ttc ctg cca acc 768 
Asn Met Val Asp Leu Ser Asp Val Tyr Met Thr Phe Phe Leu Pro Thr 
245 250 255 
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gaa cag gcg ggc acg ctg aaa ctg ggc ggt gaa gcc egg ctg ate etc 816 
Glu Gin Ala Gly Thr Leu I*ys Leu Gly Gly Glu Ala Arg Leu lie Leu 
260 265 270 

gat gcc gcg cca gat ctg cgt att cct gca ace ate agt ttt gtc gcc 864 
Asp Ala Ala Pro Asp Leu Arg lie Pro Ala Thr lie Ser Phe Val Ala 
275 280 285 

agt gtc gcc cag ttc acg cca aaa acc gtc gaa ace age gat gaa egg 912 
Ser Val Ala Gin Phe Thr Pro Lys Thr Val Glu Thr Ser Asp Glu Arg 
290 295 300 

ctg aaa ctg atg ttc cgc gtc aaa gcg cgt ate cca ccg gaa tta etc 960 
Leu Lys Leu Met Phe Arg Val Lys Ala Arg lie Pro Pro Glu Leu Leu 
305 310 315 320 

cag cag cat ctg gaa tat gtc aaa acc ggt ttg ccg ggc gta gcg tgg 1008 
Gin Gin His Leu Glu Tyr Val Lys Thr Gly Leu Pro Gly Val Ala Trp 
325 330 335 

gtg egg gtg aat gaa gaa ctt ccg tgg cct gac gac etc gtg gtg agg 1056 
Val Arg Val Asn Glu Glu Leu Pro Trp Pro Asp Asp Leu Val Val Arg 
340 345 350 

ttg ccg caa tga 1068 
Leu Pro Gin * 
355 



<210> 243 
<211> 708 
<212> UNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (708) 

<400> 243 

atg tac cgt tat ttg tct att get gcg gtg gta ctg age gca gca ttt 48 
Met Tyr Arg Tyr Leu Ser He Ala Ala Val Val Leu Ser Ala Ala Phe 
15 10 15 

tec ggc ccg gcg ttg gcc gaa ggt ate aat agt ttt tct cag gcg aaa 96 
Ser Gly Pro Ala Leu Ala Glu Gly He Asn Ser Phe Ser Gin Ala Lys 
20 25 30 

gcc gcg gcg gta aaa gtc cac get gac gcg ccc ggt acg ttt tat tgc 144 
Ala Ala Ala Val Lys Val His Ala Asp Ala Pro Gly Thr Phe Tyr Cys 
35 40 45 

gga tgt aaa att aac tgg cag ggc aaa aaa ggc gtt gtt gat ctg caa 192 
Gly Cys Lys He Asn Trp Gin Gly Lys Lys Gly Val Val Asp Leu Gin 
50 55 60 

teg tgc ggc tat cag gtg cgc aaa aat gaa aac cgc gcc age cgc gta 240 
Ser Cys Gly Tyr Gin Val Arg Lys Asn Glu Asn Arg Ala Ser Arg Val 
65 70 75 80 

gag tgg gaa cat gtc gtt ccc gcc tgg cag ttc ggt cac cag cgc cag 28B 
Glu Trp Glu His Val Val Pro Ala Trp Gin Phe Gly His Gin Arg Gin 
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85 90 95 

tgc tgg cag gac ggt gga cgt aaa aac tgc get aaa gat ccg gtc tat 336 
Cys Trp Gin Asp Gly Gly Arg Lys Asn Cys Ala Lys Asp Pro Val Tyr 
100 105 110 

cgc aag atg gaa age gat atg cat aac ctg cag ccg tea gtc ggt gag 384 
Arg Lys Met Glu Ser Asp Met His Asn Leu Gin Pro Ser Val Gly Glu 
115 120 125 

gtg aat ggc gat cgc ggc aac ttt atg tac age cag tgg aat ggc ggt 432 
Val Asn Gly Asp Arg Gly Asn Phe Met Tyr Ser Gin Trp Asn Gly Gly 
130 135 140 

gaa ggc cag tac ggt caa tgc gee atg aag gtc gat ttc aaa gaa aaa 480 
Glu Gly Gin Tyr Gly Gin Cys Ala Met Lys Val Asp Phe Lys Glu Lys 
145 ^ 150 155 160 

get gee gaa cca cca gcg cgt gca cgc ggt gee att gcg cgc acc tac 528 
Ala Ala Glu Pro Pro Ala Arg Ala Arg Gly Ala lie Ala Arg Thr Tyr 
165 170 175 

ttc tat atg cgc gac caa tac aac ctg aca etc tct cgc cag caa acg 576 
Phe Tyr Met Arg Asp Gin Tyr Asn Leu Thr Leu Ser Arg Gin Gin Thr 
180 185 190 

cag ctg ttc aac gca tgg aac aag atg tat ccg gtt acc gac tgg gag 624 
Gin Leu Phe Asn Ala Trp Asn Lys Met Tyr Pro Val Thr Asp Trp Glu 
195 200 205 

tgc gag cgc gat gaa cgc ate gcg aag gtg cag ggc aat cat aac ccg 672 
Cys Glu Arg Asp Glu Arg lie Ala Lys Val Gin Gly Asn His Asn Pro 
210 215 220 

tat gtg caa cgc get tgc cag gcg cga aag age taa 708 
Tyr Val Gin Arg Ala Cys Gin Ala Arg Lys Ser * 
225 230 235 



<210> 244 
<211> 1443 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1443) 

<400> 244 

atg tec aga agg ctt cgc aga aca aaa ate gtt acc acg tta ggc cca 48 
Met Ser Arg Arg Leu Arg Arg Thr Lys lie Val Thr Thr Leu Gly Pro 
1 .5 10 15 

gca aca gat cgc gat aat aat ctt gaa aaa gtt ate gcg gcg ggt gee 96 
Ala Thr Asp Arg Asp Asn Asn Leu Glu Lys Val He Ala Ala Gly Ala 
20 25 30 

aac gtt gta cgt atg aac ttt tct cac ggc teg cct gaa gat cac aaa 144 
Asn Val Val Arg Met Asn Phe Ser His Gly Ser Pro Glu Asp His Lys 
35 40 45 
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atg cgc gcg gat aaa gtt cgt gag att gcc gca aaa ctg ggg cgt cat 192 
Met Arg Ala Asp Lys Val Arg Glu lie Ala Ala Lys Leu Gly Arg His 
50 55 60 

gtg get att ctg ggt gac etc cag ggg ccc aaa ate cgt gta tec ace 240 
Val Ala lie Leu Gly Asp Leu Gin Gly Pro Lys lie Arg Val Ser Thr 
65 70 75 80 

ttt aaa gaa ggc aaa gtt ttc etc aat att ggg gat aaa ttc ctg etc 288 
Phe Lys Glu Gly Lys Val Phe Leu Asn lie Gly Asp Lys Phe Leu Leu 
85 90 95 

gac gcc aac ctg ggt aaa ggt gaa ggc gac aaa gaa aaa gtc ggt ate 336 
Asp Ala Asn Leu Gly Lys Gly Glu Gly Asp Lys Glu Lys Val Gly lie 
100 105 110 

gac tac aaa ggc ctg cct get gac gtc gtg cct ggt gac ate ctg ctg 384 
Asp Tyr Lys Gly Leu Pro Ala Asp Val Val Pro Gly Asp lie Leu Leu 
115 120 125 

ctg gac gat ggt cgc gtc cag tta aaa gta ctg gaa gtt cag ggc atg 432 
Leu Asp Asp Gly Arg Val Gin Leu Lys Val Leu Glu Val Gin Gly Met 
130 135 140 

aaa gtg ttc ace gaa gtc ace gtc ggt ggt ccc etc tec aac aat aaa 480 
Lys Val Phe Thr Glu Val Thr Val Gly Gly Pro Leu Ser Asn Asn Lys 
145 150 155 160 

ggt ate aac aaa ctt ggc ggc ggt ttg teg get gaa gcg ctg acc gaa 528 
Gly lie Asn Lys Leu Gly Gly Gly Leu Ser Ala Glu Ala Leu Thr Glu 
165 170 175 

aaa gac aaa gca gac att aag act gcg gcg ttg att ggc gta gat tac 576 
Lys Asp Lys Ala Asp lie Lys Thr Ala Ala Leu He Gly Val Asp Tyr 
180 185 190 

ctg get gtc tec ttc cca cgc tgt ggc gaa gat ctg aac tat gcc cgt 624 
Leu Ala Val Ser Phe Pro Arg Cys Gly Glu Asp Leu Asn Tyr Ala Arg 
195 200 205 

cgc ctg gca cgc gat gca gga tgt gat gcg a;*a att gtt gcc aag gtt 672 
Arg Leu Ala Arg Asp Ala Gly Cys Asp Ala Lys He Val Ala Lys Val 
210 215 220 

gaa cgt gcg gaa gcc gtt tgc age cag gat gca atg gat gac ate ate 720 
Glu Arg Ala Glu Ala Val Cys Ser Gin Asp Ala Met Asp Asp He He 
225 230 235 240 

etc gcc tct "gac gtg gta atg gtt gca cgt ggc gac etc ggt gtg gaa 768 
Leu Ala Ser Asp Val Val Met Val Ala Arg Gly Asp Leu Gly Val Glu 
245 250 255 

att ggc gac ccg gaa ctg gtc ggc att cag aaa gcg ttg ate cgt cgt 816 
He Gly Asp Pro Glu Leu Val Gly He Gin Lys Ala Leu He Arg Arg 
260 265 270 

gcg cgt cag eta aac cga gcg gta ate acg gcg acc cag atg atg gag 864 
Ala Arg Gin Leu Asn Arg Ala Val He Thr Ala Thr Gin Met Met Glu 
275 280 285 

tea atg att act aac ccg atg ccg acg cgt gca gaa gtc atg gac gta 912 

• *'** ■ ... « 
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Ser Met lie Thr Asn Pro Met Pro Thr Arg Ala Glu Val Met Asp Val 
290 295 300 

gca aac gcc gtt ctg gat ggt act gac get gtg atg ctg tct gca gaa 960 
Ala Asn Ala Val Leu Asp Gly Thr Asp Ala Val Met Leu Ser Ala Glu 
305 310 * 315 320 

act gcc get ggg cag tat ccg tea gaa acc gtt gca gcc atg gcg cgc 1008 
Thr Ala Ala Gly Gin Tyr Pro Ser Glu Thr Val Ala Ala Met Ala Arg 
325 330 335 

gtt tgc ctg ggt gcg gaa aaa ate ccg age ate aac gtt tct aaa cac 1056 
Val Cys Leu Gly Ala Glu Lys lie Pro Ser lie Asn Val Ser Lys His 
340 345 350 

cgt ctg gac gtt cag ttc gac aat gtg gaa gaa get att gcc atg tea 1104 
Arg Leu Asp Val Gin Phe Asp Asn Val Glu Glu Ala lie Ala Met Ser 
355 360 365 

gca atg tac gca get aac cac ctg aaa ggc gtt acg gcg ate ate acc 1152 
Ala Met Tyr Ala Ala Asn His Leu Lys Gly Val Thr Ala lie lie Thr 
370 375 380 

atg acc gaa teg ggt cgt acc gcg ctg atg acc tec cgt ate age tct 1200 
Met Thr Glu Ser Gly Arg Thr Ala Leu Met Thr Ser Arg lie Ser Ser 
385 390 395 400 

ggt ctg cca att ttc gcc atg teg cgc cat gaa cgt acg ctg aac ctg 1248 
Gly Leu Pro lie Phe Ala Met Ser Arg His Glu Arg Thr Leu Asn Leu 
405 410 415 

act get etc tat cgt ggc gtt acg ccg gtg cac ttt gat age get aat 1296 
Thr Ala Leu Tyr Arg Gly Val Thr Pro Val His Phe Asp Ser Ala Asn 
420 425 430 

gac ggc gta gca get gcc age gaa gcg gtt aat ctg ctg cgc gat aaa 1344 
Asp Gly Val Ala Ala Ala Ser Glu Ala Val Asn Leu Leu Arg Asp Lys 
435 440 445 

ggt tac ttg atg tct ggt gac ctg gtg att gtc acc cag ggc gac gtg 1392 
Gly Tyr Leu Met Ser Gly Asp Leu Val lie Val Thr Gin Gly Asp Val 
450 455 460 

atg agt acc gtg ggt tct act aat acc acg cgt att tta acg gta gag 1440 
Met Ser Thr Val Gly Ser Thr Asn Thr Thr Arg lie Leu Thr Val Glu 
465 470 475 480 

taa 1443 
* 



<210> 245 

<211> 720 

<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (720) 
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<400> 245 

atg ate aat gta tta att ate gat gac gac gca atg gtc gcg gag ctg 48 
Met lie Asn Val Leu lie lie Asp Asp Asp Ala Met Val Ala Glu Leu 
15 10 15 

aat cgc cga tac gta gca caa ate cca ggc ttt caa tgc tgt gga aca 96 
Asn Arg Arg Tyr Val Ala Gin He Pro Gly Phe Gin Cys Cys Gly Thr 
20 25 30 

gee teg acg ctg gag aaa gee aaa gag att ate ttc aat age gat acg 144 
Ala Ser Thr Leu Glu Lys Ala Lys Glu He He Phe Asn Ser Asp Thr 
35 40 45 

cct ate gac ctg at a ttg etc gat ate tat atg caa aaa gag aac ggg 192 
Pro He Asp Leu He Leu Leu Asp He Tyr Met Gin Lys Glu Asn Gly 
50 55 60 

etc gat tta ctg cct gtc ctg cat aac gcg cgt tgc aaa agt gat gtg 240 
Leu Asp Leu Leu Pro Val Leu His Asn Ala Arg Cys Lys Ser Asp Val 
65 70 75 80 

att gtc ate tec tec gca gec gat gcg gca ace att aaa gat teg ctg 288 
He Val He Ser Ser Ala Ala Asp Ala Ala Thr He Lys Asp Ser Leu 
85 90 95 

cat tac ggt gtc gtg gat tac ctg ate aaa ccc ttc cag get tec cgc 336 
His Tyr Gly Val Val Asp Tyr Leu He Lys Pro Phe Gin Ala Ser Arg 
100 105 110 

ttt gaa gag gcg etc ace ggc tgg egg caa aag aaa atg gcg ctg gaa 384 
Phe Glu Glu Ala Leu Thr Gly Trp Arg Gin Lys Lys Met Ala Leu Glu 
115 120 125 

aaa cat cag tat tat gac cag gee gag ctt gat cag eta ate cac ggc 432 
Lys His Gin Tyr Tyr Asp Gin Ala Glu Leu Asp Gin Leu He His Gly 
130 135 140 

age age tec aac gaa cag gat cct cgt cgc ttg cca aaa ggc tta acg 480 
Ser Ser Ser Asn Glu Gin Asp Pro Arg Arg Leu Pro Lys Gly Leu Thr 
145 150 ~ 155 ^ " 160 

ccg cag acg tta cgc aca ctg tgc cag tgg att gac gcg cat cag gac 528 
Pro Gin Thr Leu Arg Thr Leu Cys Gin Trp He Asp Ala His Gin Asp 
165 170 175 

tat gaa ttt tea ace gac gag eta gee aac gag gtt aac att teg cgt 576 
Tyr Glu Phe Ser Thr Asp Glu Leu Ala Asn Glu Val Asn He Ser Arg 
180 185 190 

gtt tec tgc cgt aaa tac etc ate tgg ctg gtc aac tgc cac ate ttg 624 
Val Ser Cys Arg Lys Tyr Leu He Trp Leu Val Asn Cys His He Leu 
195 200 205 

ttc acc agt ate cat tat ggc gtc acg ggg cgt cca gtt tac cgt tac 672 
Phe Thr Ser He His Tyr Gly Val Thr Gly Arg Pro Val Tyr Arg Tyr 
210 215 220 

cgt att cag gca gaa cac tac tea eta ctg aaa caa tat tgc caa taa 720 
Arg He Gin Ala Glu His Tyr Ser Leu Leu Lys Gin Tyr Cys Gin * 
225 230 235 
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<210> 246 
<211> 1632 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1632) 

<40Q> 246 

atg aga cat tea ttg ccc tac cgc atg tta cgc aaa cgt ccg atg aaa 48 
Met Arg His Ser Leu Pro Tyr Arg Met Leu Arg Lys Arg Pro Met Lys 
15 10 15 

ttg agt acc aca gtg ate tta atg gtc agt gcg gta ctg ttc teg gtg 96 
Leu Ser Thr Thr Val He Leu Met Val Ser Ala Val Leu Phe Ser Val 
20 25 30 

eta ttg gtg gtg cat ctg att tac ttc teg caa ate agt gat atg acg . 144 
Leu Leu Val Val His Leu He Tyr Phe Ser Gin He Ser Asp Met Thr 
35 40 45 

cga gat ggg eta gee aac aag gca ctg gca gtg gcg cgt acc etc gee 192 
Arg Asp Gly Leu Ala Asn Lys Ala Leu Ala Val Ala Arg Thr Leu Ala 
50 55 60 

gac teg ccg gaa ate cgt cag ggc ttg cag aaa aaa ccg cag gag agt 240 
Asp Ser Pro Glu He Arg Gin Gly Leu Gin Lys Lys Pro Gin Glu Ser 
65 70 75 80 

ggc ate cag gee ate gcg gaa gec gta cgc aaa cgc aac gat ctg ctg 288 
Gly He Gin Ala He Ala Glu Ala Val Arg Lys Arg Asn Asp Leu Leu 
85 90 95 

ttt att gtc gtt acc gat atg caa agt ctt cgc tac teg cat cct gaa 336 
Phe He Val Val Thr Asp Met Gin Ser Leu Arg Tyr Ser His Pro Glu 
100 105 110 

gec cag cgt att ggt cag cca ttt aaa ggt gat gac ate ctt aaa gcg 384 
Ala Gin Arg He Gly Gin Pro Phe Lys Gly Asp Asp He Leu Lys Ala 
115 120 125 

ctg aat ggc gaa gaa aat gtc get ate aat cgc ggt ttt ctg gcg cag 432 
Leu Asn Gly Glu Glu Asn Val Ala He Asn Arg Gly Phe Leu Ala Gin 
130 135 140 

get tta cgc gta ttt acc ccc ate tac gat gaa aat cat aaa caa att 480 
Ala Leu Arg Val Phe Thr Pro He Tyr Asp Glu Asn His Lys Gin He 
145 150 155 160 

ggc gtg gtg gcg ate ggc ctt gag tta age cgt gtg acc caa cag ate 528 
Gly Val Val Ala He Gly Leu Glu Leu Ser Arg Val Thr Gin Gin He 
165 170 175 

aat gac agt cgc tgg age att ate tgg teg gta tta ttt ggc atg ctg 576 
Asn Asp Ser Arg Trp Ser He He Trp Ser Val Leu Phe Gly Met Leu 
180 185 190 

gtc gga ctg att ggc acc tgc att ctg gtt aag gta ctg aaa aaa ate 624 
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Val Gly Leu lie Gly Thr Cys lie Leu Val Lys Val Leu Lys Lys lie 
195 200 205 

ctt ttc ggc ctg gaa ccc tac gaa ate tec acg ctg ttt gag caa cgc 672 
Leu Phe Gly Leu Glu Pro Tyr Glu lie Ser Thr Leu Phe Glu Gin Arg 
210 215 220 

cag gec atg ttg cag tct ate aaa gaa ggc gtc gtt gee gtg gac gat 720 
Gin Ala Met Leu Gin Ser He Lys Glu Gly Val Val Ala Val Asp Asp 
225 230 235 240 

cgc ggc gag gtc acg ctg ate aac gat gee gca caa gaa ttg ctg aat 768 
Arg Gly Glu Val Thr Leu He Asn Asp Ala Ala Gin Glu Leu Leu Asn 
245 250 255 

tac cgt aag teg cag gac gat gag aaa ctg teg acg eta age cac tea 816 
Tyr Arg Lys Ser Gin Asp Asp Glu Lys Leu Ser Thr Leu Ser His Ser 
260 265 270 

tgg tea cag gtg gta gat gtc teg gaa gtg tta cgc gac ggt acc ccg 864 
Trp Ser Gin Val Val Asp Val Ser Glu Val Leu Arg Asp Gly Thr Pro 
275 280 285 

cgc cgc gac gaa gag att acg att aaa gac egg eta tta ctg ate aac 912 
Arg Arg Asp Glu Glu He Thr He Lys Asp Arg Leu Leu Leu He Asn 
290 295 300 

acc gtt ccg gtg cgc agt aat ggc gtt ate ate ggt gee att tea acc 960 
Thr Val Pro Val Arg Ser Asn Gly Val He He Gly Ala He Ser Thr 
305 310 315 320 

ttc agg gac aaa act gaa gta cgt aaa ctg atg cag cga etc gac ggt 1008 
Phe Arg Asp Lys Thr Glu Val Arg Lys Leu Met Gin Arg Leu Asp Gly 
325 330 335 

ctg gtc aac tat get gac gca ctt cgt gaa cga tec cac gaa ttt atg 1056 
Leu Val Asn Tyr Ala Asp Ala Leu Arg Glu Arg Ser His Glu Phe Met 
340 345 350 

aat aaa ttg cat gtg att etc gga tta ttg cat ctg aag agt tat aag 1104 
Asn Lys Leu His Val He Leu Gly Leu Leu His Leu Lys Ser Tyr Lys 
355 360 365 

cag ttg gaa gat tac att etc aaa aca gee aat aac tat cag gaa gag 1152 
Gin Leu Glu Asp Tyr He Leu Lys Thr Ala Asn Asn Tyr Gin Glu Glu 
370 375 380 

att ggc tct ctg ctg ggt aag ate aaa tct ccg gtt ate get ggt ttt 1200 
He Gly Ser Leu Leu Gly Lys He Lys Ser Pro Val He Ala Gly Phe 
385 390 395 400 

tta ate age aag att aac cgc gcg acc gat tta ggc cat acg ctg att 1248 
Leu He Ser Lys lie Asn Arg Ala Thr Asp Leu Gly His Thr Leu He 
405 410 415 

tta aac agt gaa age cag ctg cca gac age ggc agt gag gac cag gtc 1296 
Leu Asn Ser Glu Ser Gin Leu Pro Asp Ser Gly Ser Glu Asp Gin Val 
420 425 430 

gcg acg ctg att acc acg ttg gga aat ctg ata gaa aac gcg ctg gag 1344 
Ala Thr Leu He Thr Thr Leu Gly Asn Leu He Glu Asn Ala Leu Glu 
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435 440 445 

gca tta ggg ccg gaa ccc gga ggc gaa att age gta aca ttg cac tac 1392 
Ala Leu Gly Pro Glu Pro Gly Gly Glu He Ser Val Thr Leu His Tyr 
450 455 460 

cgt cac ggc tgg ctg cac tgt gaa gtt aat gat gat gga ccg ggg ate 1440 
Arg His Gly Trp Leu His Cys Glu Val Asn Asp Asp Gly Pro Gly He 
465 470 475 480 

gca ccc gat aaa ate gat cac att ttt gac aaa ggt gtc teg aca aaa 1488 
Ala Pro Asp Lys He Asp His He Phe Asp Lys Gly Val Ser Thr Lys 
485 490 495 

gga age gag cga ggc gtc ggt tta gca ctt gtc aaa caa cag gta gaa 1536 
Gly Ser Glu Arg Gly Val Gly Leu Ala Leu Val Lys Gin Gin Val Glu 
500 505 510 

aat etc ggc ggc age ate gee gtg gaa teg gaa ccc ggg att ttc aca 1584 
Asn Leu Gly Gly Ser He Ala Val Glu Ser Glu Pro Gly He Phe Thr 
515 520 525 

caa ttt ttt gtc cag ata ccc tgg gac ggg gag agg teg aac aga tga 1632 
Gin Phe Phe Val Gin He Pro Trp Asp Gly Glu Arg Ser Asn Arg * 
530 535 540 



<210> 247 








<211> 987 








<212> DNA 








<213> Escherichia coli 








<220> 








<221> CDS 








<222> (1) . . . (987) 








<400> 247 








ttg agt gta ccg ctg teg 


aca tgg 


aat 


ctt 


Met Ser Val Pro Leu Ser 


Thr Trp 


Asn 


Leu 


1 5 






10 



15 



48 



tat eta cag aag gta act atg ttt cca caa tgc aaa ttt tec cgc gag 96 

Tyr Leu Gin Lys Val Thr Met Phe Pro Gin Cys Lys Phe Ser Arg Glu 
20 25 30 

ttt eta cat .cct cgc tac tgg etc aca tgg ttt ggg ctt ggt gta etc 144 

Phe Leu His Pro Arg Tyr Trp Leu Thr Trp Phe Gly Leu Gly Val Leu 
35 40 45 

tgg ctt tgg gta cag ctt cct tat cct gtt etc tgc ttt etc ggc acg 192 

Trp Leu Trp Val Gin , Leu Pro Tyr Pro Val Leu Cys Phe Leu Gly Thr 
50 55 60 

cgt att ggc gca atg gcg cga cca ttc ctg aaa cgt cgt gaa tct ate 240 

Arg He Gly Ala Met Ala Arg Pro Phe Leu Lys Arg Arg Glu Ser He 
65 70 75 80 

gee cgt aaa aac ctg gaa ctt tgt ttc ccg cag cat tct gcg gaa gaa 288 

Ala Arg Lys Asn Leu Glu Leu Cys Phe Pro Gin His Ser Ala Glu Glu 
85 90 95 
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cgc gag aag atg att gcc gaa aac ttt cgt tea etc ggc atg gcg ctg 336 
Arg Glu Lys Met He Ala Glu Asn Phe Arg Ser Leu Gly Met Ala Leu 
100 105 110 

gta gaa acc ggc atg gca tgg ttc tgg ccc gac agt cgc gta cgt aaa 384 
Val Glu Thr Gly Met Ala Trp Phe Trp Pro Asp Ser Arg Val Arg Lys 
115 120 125 

tgg ttt gat gtt gaa ggg ttg gat aac ctt aaa cgc gca caa atg caa 432 
Trp Phe Asp Val Glu Gly Leu Asp Asn Leu Lys Arg Ala Gin Met Gin 
130 135 140 

aat cgc ggc gta atg gtt gtc ggc gtc cat ttt atg teg ctg gaa ctg 480 
Asn Arg Gly Val Met Val Val Gly Val His Phe Met Ser Leu Glu Leu 
145 150 155 160 

ggc gg c cgc gtg atg gga ctg tgc caa cca atg atg get acc tat cgt 528 
Gly Gly Arg Val Met Gly Leu Cys Gin Pro Met Met Ala Thr Tyr Arg 
165 170 175 

cca cat aat aat cag ctg atg gaa tgg gtg cag acc cgt ggg cgc atg 576 
Pro His Asn Asn Gin Leu Met Glu Trp Val Gin Thr Arg Gly Arg Met 
180 185 190 

cgc tct aac aaa gcg atg ate ggc aga aat aat ctg cgc ggc att gtc 624 
Arg Ser Asn Lys Ala Met He Gly Arg Asn Asn Leu Arg Gly He Val 
195 200 205 

ggt gca ctg aag aaa ggt gaa gcg gta tgg ttt get ccc gat cag gat 672 
Gly Ala Leu Lys Lys Gly Glu Ala Val Trp Phe Ala Pro Asp Gin Asp 
210 215 220 

tat ggt cgt aaa ggc age tec ttc gcg ccg ttc ttt gcg gtg gaa aat 720 
Tyr Gly Arg Lys Gly Ser Ser Phe Ala Pro Phe Phe Ala Val Glu Asn 
225 230 235 240 

gtc gcc aca acc aat ggc acc tat gtt etc tec cgt etc tct ggc gca 768 
Val Ala Thr Thr Asn Gly Thr Tyr Val Leu Ser Arg Leu Ser Gly Ala 
245 250 255 

gcc atg ttg acc gta acg atg gta aga aaa gcg gat tac age gga tat 816 
Ala Met Leu Thr Val Thr Met Val Arg Lys Ala Asp Tyr Ser Gly Tyr 
260 265 270 

cgt ttg ttc ate acc cca gag atg gaa ggc tac ccg aca gat gaa aat 864 
Arg Leu Phe He Thr Pro Glu Met Glu Gly Tyr Pro Thr Asp Glu Asn 
275 280 285 

caa gcc get gcc tat atg aac aag att ate gag aaa gag ate atg cgc 912 
Gin Ala Ala Ala Tyr Met Asn Lys He He Glu Lys Glu He Met Arg 
290 295 300 

gca ccg gag cag tac etc tgg ate cac cgt cgc ttt aaa acg cgc ccg 960 
Ala Pro Glu Gin Tyr Leu Trp He His Arg Arg Phe Lys Thr Arg Pro 
305 310 315 320 

gtg gga gaa teg teg ttg tac att taa 987 
Val Gly Glu Ser Ser Leu Tyr He * 
325 
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<210> 248 
<2ll> 264 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (264) 

<400> 248 

ttg get aat ate aaa tea get aag aag cgc gee att cag tct gaa aag 48 
Met Ala Asn lie Lys Ser Ala Lys Lys Arg Ala lie Gin Ser Glu Lys 
15 10 15 

get cgt aag cac aac gca age cgt cgc tct atg atg cgt act ttc ate 96 
Ala Arg Lys His Asn Ala Ser Arg Arg Ser Met Met Arg Thr Phe lie 
20 25 30 

aag aaa gta tac gca get ate gaa get ggc gac aaa get get gca cag 144 
Lys Lys Val Tyr Ala Ala lie Glu Ala Gly Asp Lys Ala Ala Ala Gin 
35 40 45 

aaa gca ttt aac gaa atg caa ccg ate gtg gac cgt cag get get aaa 192 
Lys Ala Phe Asn Glu Met Gin Pro lie Val Asp Arg Gin Ala Ala Lys 
50 55 60 

ggt ctg ate cac aaa aac aaa get gca cgt cat aag get aac ctg act 240 
Gly Leu lie His Lys Asn Lys Ala Ala Arg His Lys Ala Asn Leu Thr 
65 70 75 80 

gca cag ate aac aaa ctg get taa 264 
Ala Gin He Asn Lys Leu Ala * 
85 



<210> 249 
<211> 1293 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1293) 

<400> 249 

atg cgc tat aat ggt tta aat aat atg ttt ttc cct ctt tgc ctg att 48 
Met Arg Tyr Asn Gly Leu Asn Asn Met Phe Phe Pro Leu Cys Leu He 
15 10 15 

aac gat aac cac tct gtc aca agt cca tea cat aca aag aaa aca aaa 96 
Asn Asp Asn His Ser Val Thr Ser Pro Ser His Thr Lys Lys Thr Lys 
20 25 30 

tea gat aat tac age aaa cat cat aaa aac acg tta att gac aat aaa 144 
Ser Asp Asn Tyr Ser Lys His His Lys Asn Thr Leu He Asp Asn Lys 
35 40 45 

gee etc tct ctt ttc aaa atg gat gat cat gaa aaa gtg ata ggc ttg 192 
Ala Leu Ser Leu Phe Lys Met Asp Asp His Glu Lys Val He Gly Leu 
50 55 60 
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att cag aaa atg aaa aga att tat gat agt tta cca tea gga aaa ate 240 
lie Gin Lys Met Lys Arg lie Tyr Asp Ser Leu Pro Ser Gly Lys He 
65 70 75 80 

acg aaa gaa acg gac agg aaa ata cat aaa tat ttt ata gat ata get 288 
Thr Lys Glu Thr Asp Arg Lys lie His Lys Tyr Phe He Asp He Ala 
85 90 , 95 

tea cat gca aat aat aaa tgt gac gat aga att acg aga aga gtt tac 336 
Ser His Ala Asn Asn Lys Cys Asp Asp Arg He Thr Arg Arg Val Tyr 
100 105 110 

ctt aat aaa gat aag gaa gtg tea att aag gtg gta tat ttt ata aat 384 
Leu Asn Lys Asp Lys Glu Val Ser He Lys Val Val Tyr Phe He Asn 
115 120 125 

aat gtc acc gtc cat aat aat act ate gaa ate cca cag aca gta aat 432 
Asn Val Thr Val His Asn Asn Thr He Glu He Pro Gin Thr Val Asn 
130 135 140 

ggt ggt tac gat ttt tea cac ctt age ctg aaa ggt ate gtg att aaa 480 
Gly Gly Tyr Asp Phe Ser His Leu Ser Leu Lys Gly He Val He Lys 
145 150 155 160 

gat gaa gat tta tec aat teg aat ttt gca ggt tgc aga eta caa aac 528 
Asp Glu Asp Leu Ser Asn Ser Asn Phe Ala Gly Cys Arg Leu Gin Asn 
165 170 175 

get att ttt cag gac tgt aat atg tat aaa acg aat ttt aat ttc gee 576 
Ala He Phe Gin Asp Cys Asn Met Tyr Lys Thr Asn Phe Asn Phe Ala 
180 185 190 

ata atg gaa aaa ata ctt ttt gat aat tgt att etc gat gac tea aat 624 
He Met Glu Lys He Leu Phe Asp Asn Cys He Leu Asp Asp Ser Asn 
195 200 205 

ttc get cag ata aaa atg act gac gga act eta aat tea tgt tec get 672 
Phe Ala Gin He Lys Met Thr Asp Gly Thr Leu Asn Ser Cys Ser Ala 
210 215 220 

atg cat gtt caa ttc tac aat gca aca atg aat aga gee aat att aaa 720 
Met His Val Gin Phe Tyr Asn Ala Thr Met Asn Arg Ala Asn He Lys 
225 230 235 240 

aat acc ttc ctt gat tat tea aat ttt tat atg gca tac atg get gag 768 
Asn Thr Phe Leu Asp Tyr Ser Asn Phe Tyr Met Ala Tyr Met Ala Glu 
245 250 255 

gta aat ctt tat aaa gta ata gcg cca tat att aat tta ttt aga gec 816 
Val Asn Leu Tyr Lys Val He Ala Pro Tyr He Asn Leu Phe Arg Ala 
260 265 270 

gac ctt age ttc tct aaa ctt gat tta att aac ttt gaa cat get gat 864 
Asp Leu Ser Phe Ser Lys Leu Asp Leu He Asn Phe Glu His Ala Asp 
275 280 285 

ctg tct cgt gtc aac ctg aat aaa gca acc etc cag aat ata aac tta 912 
Leu Ser Arg Val Asn Leu Asn Lys Ala Thr Leu Gin Asn He Asn Leu 
290 295 300 
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att gat age aaa etc ttt ttt acg egg tta aca aat acg ttc etc gaa 960 
lie Asp Ser Lys Leu Phe Phe Thr Arg Leu Thr Asn Thr Phe Leu Glu 
305 310 315 320 

atg gtt ata tgt ace gac tct aat atg get aat gtt aat ttt aat aat 1008 
Met Val lie Cys Thr Asp Ser Asn Met Ala Asn Val Asn Phe Asn Asn 
325 330 335 

gee aat tta age aat tgc cat ttc aac tgt tct gtt tta aca aaa gee 1056 
Ala Asn Leu Ser Asn Cys His Phe Asn Cys Ser Val Leu Thr Lys Ala 
340 345 350 

tgg atg ttt aat ate cgt etc tat cgt gtt aat ttc gat gag get age 1104 
Trp Met Phe Asn lie Arg Leu Tyr Arg Val Asn Phe Asp Glu Ala Ser 
355 360 365 

gtc cag gga atg ggt att ace att etc cgt ggt gag gaa aat ate tec 1152 
Val Gin Gly Met Gly lie Thr lie Leu Arg Gly Glu Glu Asn lie Ser 
370 375 380 

att aat agt gat ate ctg gta aca eta cag aaa ttc ttt gaa gaa gat 1200 
lie Asn Ser Asp lie Leu Val Thr Leu Gin Lys Phe Phe Glu Glu Asp 
385 390 395 400 

tgt gec act cat act ggc atg tea caa act gag gat aat ctt cat gca 1248 
Cys Ala Thr His Thr Gly Met Ser Gin Thr Glu Asp Asn Leu His Ala 

405 410 415 

gtc get atg aag att act gca gat att atg caa gat gca gat tga 1293 
Val Ala Met Lys lie Thr Ala Asp He Met Gin Asp Ala Asp * 
420 425 430 



<210> 250 
<211> 2652 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (2652) 
<400> 250 

atg aac gaa caa tat tec gca ttg cgt agt aat gtc agt atg etc ggc 48 
Met Asn Glu Gin Tyr Ser Ala Leu Arg Ser Asn Val Ser Met Leu Gly 
15 10 15 

aaa gtg ctg gga gaa ace ate aag gat gcg ttg gga gaa cac att ctt 96 
Lys Val Leu Gly Glu Thr He Lys Asp Ala Leu Gly Glu His He Leu 
20 25 30 

gaa cgc gta gaa act ate cgt aag ttg teg aaa tct tea cgc get ggc 144 
Glu Arg Val Glu Thr He Arg Lys Leu Ser Lys Ser Ser Arg Ala Gly 
35 40 45 

aat gat get aac cgc cag gag ttg etc ace acc tta caa aat ttg teg 192 
Asn Asp Ala Asn Arg Gin Glu Leu Leu Thr Thr Leu Gin Asn Leu Ser 
50 55 60 

aac gac gag ctg ctg ccc gtt gcg cgt gcg ttt agt cag ttc ctg aac 240 
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Asn Asp Glu Leu Leu Pro Val Ala Arg Ala Phe Ser Gin Phe Leu Asn 
65 70 75 80 

ctg gcc aac acc gcc gag caa tac cac age att teg ccg aaa ggc gaa 288 
Leu Ala Asn Thr Ala Glu Gin Tyr His Ser lie Ser Pro Lys Gly Glu 
85 90 95 

get gcc age aac ccg gaa gtg ate gcc cgc acc ctg cgt aaa ctg aaa 336 
Ala Ala Ser Asn Pro Glu Val lie Ala Arg Thr Leu Arg Lys Leu Lys 
100 105 110 

aac cag ccg gaa ctg age gaa gac acc ate aaa aaa gca gtg gaa teg 384 
Asn Gin Pro Glu Leu Ser Glu Asp Thr lie Lys Lys Ala Val Glu Ser 
115 120 125 

ctg teg ctg gaa ctg gtc etc acg get cac cca acc gaa att acc cgt 432 
Leu Ser Leu Glu Leu Val Leu Thr Ala His Pro Thr Glu lie Thr Arg 
130 135 140 

cgt aca ctg ate cac aaa atg gtg gaa gtg aac gcc tgt tta aaa cag 480 
Arg Thr Leu lie His Lys Met Val Glu Val Asn Ala Cys Leu Lys Gin 
145 ) 150 155 160 

etc gat aac aaa gat ate get gac tac gaa cac aac cag ctg atg cgt 528 
Leu Asp Asn Lys Asp lie Ala Asp Tyr Glu His Asn Gin Leu Met Arg 
165 170 175 

cgc ctg cgc cag ttg ate gcc cag tea tgg cat acc gat gaa ate cgt 576 
Arg Leu Arg Gin Leu lie Ala Gin Ser Trp His Thr Asp Glu lie Arg 
180 185 " 190 

aag ctg cgt cca age ccg gta gat gaa gcc aaa tgg ggc ttt gcc gta 624 
Lys Leu Arg Pro Ser Pro Val Asp Glu Ala Lys Trp Gly Phe Ala Val 
195 200 205 

gtg gaa aac age ctg tgg caa ggc gta cca aat tac ctg cgc gaa ctg 672 
Val Glu Asn Ser Leu Trp Gin Gly Val Pro Asn Tyr Leu Arg Glu Leu 
210 215 220 

aac gaa caa ctg gaa gag aac etc ggc tac aaa ctg ccc gtc gaa ttt 720 
Asn Glu Gin Leu Glu Glu Asn Leu Gly Tyr Lys Leu Pro Val Glu Phe 
225 230 235 240 

gtt ccg gtc cgt ttt act teg tgg atg ggc ggc gac cgc gac ggc aac 768 
Val Pro Val Arg Phe Thr Ser Trp Met Gly Gly Asp Arg Asp Gly Asn 
245 250 255 

ccg aac gtc act gcc gat ate acc cgc cac gtc ctg eta etc age cgc 816 
Pro Asn Val Thr Ala Asp lie Thr Arg His Val Leu Leu Leu Ser Arg 
260 265 270 

tgg aaa gcc acc gat ttg ttc ctg aaa gat att cag gtg ctg gtt* tct 864 
Trp Lys Ala Thr Asp Leu Phe Leu Lys Asp lie Gin Val Leu Val Ser 
275 280 285 

gaa ctg teg atg gtt gaa gcg acc cct gaa ctg ctg gcg ctg gtt ggc 912 
Glu Leu Ser Met Val Glu Ala Thr Pro Glu Leu Leu Ala Leu Val Gly 
290 295 300 

gaa gaa ggt gcc gca gaa ccg tat cgc tat ctg atg aaa aac ctg cgt 960 
Glu Glu Gly Ala Ala Glu Pro Tyr Arg Tyr Leu Met Lys Asn Leu Arg 
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305 310 315 320 

tct cgc ctg atg gcg aca cag gca tgg ctg gaa gcg cgc ctg aaa ggc 1008 
Ser Arg Leu Met Ala Thr Gin Ala Trp Leu Glu Ala Arg Leu Lys Gly 
325 330 335 

gaa gaa ctg cca aaa cca gaa ggc ctg ctg aca caa aac gaa gaa ctg 1056 
Glu Glu Leu Pro Lys Pro Glu Gly Leu Leu Thr Gin Asn Glu Glu Leu 
340 345 350 

tgg gaa ccg etc tac get tgc tac cag tea ctt cag gcg tgt ggc atg 1104 
Trp Glu Pro Leu Tyr Ala Cys Tyr Gin Ser Leu Gin Ala Cys Gly Met 
355 360 365 

ggt att ate gee aac ggc gat ctg etc gac acc ctg cgc cgc gtg aaa 1152 
Gly lie lie Ala Asn Gly Asp Leu Leu Asp Thr Leu Arg Arg Val Lys 
370 375 380 

tgt ttc ggc gta ccg ctg gtc cgt att gat ate cgt cag gag age acg 1200 
Cys Phe Gly Val Pro Leu Val Arg lie Asp lie Arg Gin Glu Ser Thr 
385 390 395 400 

cgt cat acc gaa gcg ctg ggc gag ctg acc cgc tac etc ggt ate ggc 1248 
Arg His Thr Glu Ala Leu Gly Glu Leu Thr Arg Tyr Leu Gly He Gly 
405 410 415 

gac tac gaa age tgg tea gag gee gac aaa cag gcg ttc ctg ate cgc 1296 
Asp Tyr Glu Ser Trp Ser Glu Ala Asp Lys Gin Ala Phe Leu lie Arg 
420 425 430 

gaa ctg aac tec aaa cgt ccg ctt ctg ccg cgc aac tgg caa cca age 1344 
Glu Leu Asn Ser Lys Arg Pro Leu Leu Pro Arg Asn Trp Gin Pro Ser 
435 440 445 

gee gaa acg cgc gaa gtg etc gat acc tgc cag gtg att gee gaa gca 1392 
Ala Glu Thr Arg Glu Val Leu Asp Thr Cys Gin Val He Ala Glu Ala 
450 ^ 455 460 

ccg caa ggc tec att gee gee tac gtg ate teg atg gcg aaa acg ccg 1440 
Pro Gin Gly Ser lie Ala Ala Tyr Val lie Ser Met Ala Lys Thr Pro 
465 470 475 480 

tec gac gta ctg get gtc cac ctg ctg ctg aaa gaa gcg ggt ate ggg 1488 
Ser Asp Val Leu Ala Val His Leu Leu Leu Lys Glu Ala Gly He Gly 
485 490 495 

ttt gcg atg ccg gtt get ccg ctg ttt gaa acc etc gat gat ctg aac 1536 
Phe Ala Met Pro Val Ala Pro Leu Phe Glu Thr Leu Asp Asp Leu Asn 
500 505 510 

aac gee aac gat gtc atg acc cag ctg etc aat att gac tgg tat cgt 1584 
Asn Ala Asn Asp Val Met Thr Gin Leu Leu Asn He Asp Trp Tyr Arg 
515 520 525 

ggc ctg att cag ggc aaa cag atg gtg atg att ggc tat tec gac tea 1632 
Gly Leu He Gin Gly Lys Gin Met Val Met He Gly Tyr Ser Asp Ser 
530 535 540 

gca aaa gat gcg gga gtg atg gca get tec tgg gcg caa tat cag gca 1680 
Ala Lys Asp Ala Gly Val Met Ala Ala Ser Trp Ala Gin Tyr Gin Ala 
545 550 555 560 
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cag gat gca tta ate aaa acc tgc gaa aaa gcg ggt att gag ctg acg 1728 
Gin Asp Ala Leu lie Lys Thr Cys Glu Lys Ala Gly lie Glu Leu Thr 
565 570 . 575 

ttg ttc cac ggt cgc ggc ggt tec att ggt cgc ggc ggc gca cct get 1776 
Leu Phe His Gly Arg Gly Gly Ser He Gly Arg Gly Gly Ala Pro Ala 
580 585 590 

cat gcg gcg ctg ctg tea caa ccg cca gga age ctg aaa ggc ggc ctg 1824 
His Ala Ala Leu Leu Ser Gin Pro Pro Gly Ser Leu Lys Gly Gly Leu 
595 600 605 

cgc gta acc gaa cag ggc gag atg ate cgc ttt aaa tat ggt ctg cca 1872 
Arg Val Thr Glu Gin Gly Glu Met He Arg Phe Lys Tyr Gly Leu Pro 
610 615 620 

gaa ate acc gtc age age ctg teg ctt tat acc ggg gcg att ctg gaa 1920 
Glu He Thr Val Ser Ser Leu Ser Leu Tyr Thr Gly Ala He Leu Glu 
625 630 635 640 

gec aac ctg ctg cca ccg ccg gag ccg aaa gag age tgg cgt cgc att 1968 
Ala Asn Leu Leu Pro Pro Pro Glu Pro Lys Glu Ser Trp Arg Arg He 
645 650 655 

atg gat gaa ctg tea gtc ate tec tgc gat gtc tac cgc ggc tac gta 2016 
Met Asp Glu Leu Ser Val He Ser Cys Asp Val Tyr Arg Gly Tyr Val 
660 665 670 

cgt gaa aac aaa gat ttt gtg cct tac ttc cgc tec get acg ccg gaa 2064 
Arg Glu Asn Lys Asp Phe Val Pro Tyr Phe Arg Ser Ala Thr Pro Glu 
675 680 685 

caa gaa ctg ggc aaa ctg ccg ttg ggt tea cgt ccg gcg aaa cgt cgc 2112 
Gin Glu Leu Gly Lys Leu Pro Leu Gly Ser Arg Pro Ala Lys Arg Arg 
690 695 700 

cca acc ggc ggc gtc gag tea eta cgc gee att ccg tgg ate ttc gec 2160 
Pro Thr Gly Gly Val Glu Ser Leu Arg Ala He Pro Trp He Phe Ala 
705 710 715 720 

tgg acg caa aac cgt ctg atg etc ccc gec tgg ctg ggt gca ggt acg 2208 
Trp Thr Gin Asn Arg Leu Met Leu Pro Ala Trp Leu Gly Ala Gly Thr 
725 730 735 

gcg ctg caa aaa gtg gtc gaa gac ggc aaa cag age gag ctg gag get 2256 
Ala Leu Gin Lys Val Val Glu Asp Gly Lys Gin Ser Glu Leu Glu Ala 
740 745 750 

atg tgc cgc gat tgg cca ttc ttc teg acg cgt etc ggc atg ctg gag 2304 
Met Cys Arg Asp Trp Pro Phe Phe Ser Thr Arg Leu Gly Met Leu Glu 
755 760 765 

atg gtc ttc gec aaa gca gac ctg tgg ctg gcg gaa tac tat gac caa 2352 
Met Val Phe Ala Lys Ala Asp Leu Trp Leu Ala Glu Tyr Tyr Asp Gin 
770 775 780 

cgc ctg gta gac aaa gca ctg tgg ccg tta ggt aaa gag tta cgc aac 2400 
Arg Leu Val Asp Lys Ala Leu Trp Pro Leu Gly Lys Glu Leu Arg Asn 
785 790 795 800 
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ctg caa gaa gaa gac ate aaa gfcg gtg ctg gcg att gec aac gat tec 2448 
l*eu Gin Glu Glu Asp lie Lys Val Val Leu Ala lie Ala Asn Asp Ser 
805 ~ 810 815 

cat ctg atg gec gat ctg ccg tgg att gca gag tct att cag eta egg 2496 
His Leu Met Ala Asp Leu Pro Trp He Ala Glu Ser lie Gin Leu Arg 
820 825 830 

aat att tac acc gac ccg ctg aac gta ttg cag gec gag ttg ctg cac 2544 
Asn He Tyr Thr Asp Pro Leu Asn Val Leu Gin Ala Glu Leu Leu His 
835 840 845 

cgc tec cgc cag gca gaa aaa gaa ggc cag gaa ccg gat cct cgc gtc 2592 
Arg Ser Arg Gin Ala Glu Lys Glu Gly Gin Glu Pro Asp Pro Arg Val 
850 855 860 

gaa caa gcg tta atg gtc act att gec ggg att gcg gca ggt atg cgt 2640 
Glu Gin Ala Leu Met Val Thr He Ala Gly He Ala Ala Gly Met Arg 
865 870 875 880 

aat acc ggc taa 2652 
Asn Thr Gly * 



<210> 251 
<211> 1992 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1992) 

<400> 251 

atg tec tea cgt aaa gag ctt gec aat get att cgt gcg ctg age atg 48 

Met Ser Ser Arg Lys Glu Leu Ala Asn Ala He Arg Ala Leu Ser Met 
1 5 10 15 

gac gca gta cag aaa gee aaa tec ggt cac ccg ggt gec cct atg ggt 96 
Asp Ala Val Gin Lys Ala Lys Ser Gly His Pro Gly Ala Pro Met Gly 
20 25 30 

atg get gac att gec gaa gtc ctg tgg cgt gat ttc ctg aaa cac aac 144 
Met Ala Asp He Ala Glu Val Leu Trp Arg Asp Phe Leu Lys His Asn 
35 40 45 

ccg cag aat ccg tec tgg get gac cgt gac cgc ttc gtg ctg tec aac 192 
Pro Gin Asn Pro Ser Trp Ala Asp Arg Asp Arg Phe Val Leu Ser Asn 
50 55 60 

ggc cac ggc tec atg ctg ate tac age ctg ctg cac etc acc ggt tac 240 
Gly His Gly Ser Met Leu He Tyr Ser Leu Leu His Leu Thr Gly Tyr 
65 70 75 80 

gat ctg ccg atg gaa gaa ctg aaa aac ttc cgt cag ctg cac tct aaa 288 
Asp Leu Pro Met Glu Glu Leu Lys Asn Phe Arg Gin Leu His Ser Lys 
85 90 95 

act ccg ggt cac ccg gaa gtg ggt tac acc get ggt gtg gaa acc acc 336 
Thr Pro Gly His Pro Glu Val Gly Tyr Thr Ala Gly Val Glu Thr Thr 
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100 105 110 

acc ggt ccg ctg ggt cag ggt att gcc aac gca gtc ggt atg gcg att 384 
Thr Gly Pro Leu Gly Gin Gly He Ala Asn Ala Val Gly Met Ala He 
115 120 125 

gca gaa aaa acg ctg gcg gcg cag ttt aac cgt ccg ggc cac gac att 432 
Ala Glu Lys Thr Leu Ala Ala Gin Phe Asn Arg Pro Gly His Asp He 
130 135 140 

gtc gac cac tac acc tac gcc ttc atg ggc gac ggc tgc atg atg gaa 480 
Val Asp His Tyr Thr Tyr Ala Phe Met Gly Asp Gly Cys Met Met Glu 
145 150 155 160 

ggc ate tec cac gaa gtt tgc tct ctg gcg ggt acg ctg aag ctg ggt 528 
Gly He Ser His Glu Val Cys Ser Leu Ala Gly Thr Leu Lys Leu Gly 
165 170 175 

aaa ctg att gca ttc tac gat gac aac ggt att tct ate gat ggt cac 576 
Lys Leu He Ala Phe Tyr Asp Asp Asn Gly He Ser He Asp Gly His 
180 185 190 

gtt gaa ggc tgg ttc acc gac gac acc gca atg cgt ttc gaa get tac 624 
Val Glu Gly Trp Phe Thr Asp Asp Thr Ala Met Arg Phe Glu Ala Tyr 
195 200 205 

ggc tgg cac gtt att cgc gac ate gac ggt cat gac gcg gca tct ate 672 
Gly Trp His Val He Arg Asp He Asp Gly His Asp Ala Ala Ser He 
210 215 220 

aaa cgc gca gta gaa gaa gcg cgc gca gtg act gac aaa cct tec ctg 720 
Lys Arg Ala Val Glu Glu Ala Arg Ala Val Thr Asp Lys Pro Ser Leu 
225 230 235 240 

ctg atg tgc aaa acc ate ate ggt ttc ggt tec ccg aac aaa gcc ggt 768 
Leu Met Cys Lys Thr He He Gly Phe Gly Ser Pro Asn Lys Ala Gly 
245 250 255 

acc cac gac tec cac ggt gcg ccg ctg ggc gac get gaa att gcc ctg 816 
Thr His Asp Ser His Gly Ala Pro Leu Gly Asp Ala Glu He Ala Leu 
260 265 270 

acc cgc gaa caa ctg ggc tgg aaa tat gcg ccg ttc gaa ate ccg tct 864 
Thr Arg Glu Gin Leu Gly Trp Lys Tyr Ala Pro Phe Glu He Pro Ser 
275 280 285 

gaa ate tat get cag tgg gat gcg aaa gaa gca ggc cag gcg aaa gaa 912 
Glu He Tyr Ala Gin Trp Asp Ala Lys Glu Ala Gly Gin Ala Lys Glu 
290 295 300 

tec gca tgg aac gag aaa ttc get get tac gcg aaa get tat ccg cag 960 
Ser Ala Trp Asn Glu Lys Phe Ala Ala Tyr Ala Lys Ala Tyr Pro Gin 
305 310 315 320 

gaa gcc get gaa ttt acc cgc cgt atg aaa ggc gaa atg ccg tct gac 1008 
Glu Ala Ala Glu Phe Thr Arg Arg Met Lys Gly Glu Met Pro Ser Asp 
325 330 335 

ttc gac get aaa gcg aaa gag ttc ate get aaa ctg cag get aat ccg 1056 
Phe Asp Ala Lys Ala Lys Glu Phe He Ala Lys Leu Gin Ala Asn Pro 
340 345 350 
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gcg aaa ate gec age cgt aaa gcg tct cag aat get ate gaa gcg ttc 1104 
Ala Lys lie Ala Ser Arg Lys Ala Ser Gin Asn Ala He Glu Ala Phe 
355 360 365 

ggt ccg ctg ttg ccg gaa ttc etc ggc ggt tct get gac ctg gcg ccg 1152 
Gly Pro Leu Leu Pro Glu Phe Leu Gly Gly Ser Ala Asp Leu Ala Pro 
370 375 380 

tct aac ctg acc ctg tgg tct ggt tct aaa gca ate aac gaa gat get 1200 
Ser Asn Leu Thr Leu Trp Ser Gly Ser Lys Ala He Asn Glu Asp Ala 
385 390 395 400 

gcg ggt aac tac ate cac tac ggt gtt cgc gag ttc ggt atg acc gcg 1248 
Ala Gly Asn Tyr He His Tyr Gly Val Arg Glu Phe Gly Met Thr Ala 
405 410 415 

att get aac ggt ate tec ctg cac ggt ggc ttc ctg ccg tac acc tec 1296 
He Ala Asn Gly He Ser Leu His Gly Gly Phe Leu Pro Tyr Thr Ser 
420 425 430 

acc ttc ctg atg ttc gtg gaa tac gca cgt aac gee gta cgt atg get 1344 
Thr Phe Leu Met Phe Val Glu Tyr Ala Arg Asn Ala Val Arg Met Ala 
435 440 445 

gcg ctg atg aaa cag cgt cag gtg atg gtt tac acc cac gac tec ate 1392 
Ala Leu Met Lys Gin Arg Gin Val Met Val Tyr Thr His Asp Ser He 
450 455 460 

ggt ctg ggc gaa gac ggg ccg act cac cag ccg gtt gag cag gtc get 1440 
Gly Leu Gly Glu Asp Gly Pro Thr His Gin Pro Val Glu Gin Val Ala 
465 470 475 480 

tct ctg cgc gta acc ccg aac atg tct aca tgg cgt ccg tgt gac cag 1488 
Ser Leu Arg Val Thr Pro Asn Met Ser Thr Trp Arg Pro Cys Asp Gin 
485 490 495 

gtt gaa tec gcg gtc gcg tgg aaa tac ggt gtt gag cgt cag gac ggc 1536 
Val Glu Ser Ala Val Ala Trp Lys Tyr Gly Val Glu Arg Gin Asp Gly 
500 505 510 

ccg acc gca ctg ate etc tec cgt cag aac ctg gcg cag cag gaa cga 1584 
Pro Thr Ala Leu He Leu Ser Arg Gin Asn Leu Ala Gin Gin Glu Arg 
515 520 525 

act gaa gag caa ctg gca aac ate gcg cgc ggt ggt tat gtg ctg aaa 1632 
Thr Glu Glu Gin Leu Ala Asn He Ala Arg Gly Gly Tyr Val Leu Lys 
530 535 540 

gac tgc gee ggt cag ccg gaa ctg att ttc ate get acc ggt tea gaa 1680 
Asp Cys Ala Gly Gin Pro Glu Leu He Phe He Ala Thr Gly Ser Glu 
545 550 555 560 

gtt gaa ctg get gtt get gee tac gaa aaa ctg act gee gaa ggc gtg 1728 
Val Glu Leu Ala Val Ala Ala Tyr Glu Lys Leu Thr Ala Glu Gly Val 
565 570 575 

aaa gcg cgc gtg gtg tec atg teg tct acc gac gca ttt gac aag cag 1776 
Lys Ala Arg Val Val Ser Met Ser Ser Thr Asp Ala Phe Asp Lys Gin 
580 585 590 
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gat get get tac cgt gaa tec gta ctg ccg aaa gcg gtt act gca cgc 1824 
Asp Ala Ala Tyr Arg Glu Ser Val Leu Pro Lys Ala Val Thr Ala Arg 
595 600 605 

gtt get gta gaa gcg ggt att get gac tac tgg tac aag tat gtt ggc 1872 
Val Ala Val Glu Ala Gly He Ala Asp Tyr Trp Tyr Lys Tyr Val Gly 
610 615 620 

ctg aac ggt get ate gtc ggt atg acc ace ttc ggt gaa tct get ccg 1920 
Leu Asn Gly Ala He Val Gly Met Thr Thr Phe Gly Glu Ser Ala Pro 
625 630 635 640 

gca gag ctg ctg ttt gaa gag ttc ggc ttc act gtt gat aac gtt gtt 1968 
Ala Glu Leu Leu Phe Glu Glu Phe Gly Phe Thr Val Asp Asn Val Val 
645 650 655 

gcg aaa gca aaa gaa ctg ctg taa 1992 
Ala Lys Ala Lys Glu Leu Leu * 
660 



<210> 252 
<211> 459 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (459) 

<400> 252 

atg ttc egg gga gca acg tta gtc aat etc gac age aaa ggg cgc tta 48 
Met Phe Arg Gly Ala Thr Leu Val Asn Leu Asp Ser Lys Gly Arg Leu 
15 10 15 

tea gtg cct acc cgt tat egg gaa cag ctg ctt gag aac get gee ggt 96 
Ser Val Pro Thr Arg Tyr Arg Glu Gin Leu Leu Glu Asn Ala Ala Gly 
20 25 30 

caa atg gtt tgc acc att gac att tat cac ccg tgc ctg ctg ctt tac 144 
Gin Met Val Cys Thr He Asp He Tyr His Pro Cys Leu Leu Leu Tyr 
35 40 45 

ccc ctg cct gaa tgg gaa att ate gag caa aaa tta teg cgt ctg teg 192 
Pro Leu Pro Glu Trp Glu He He Glu Gin Lys Leu Ser Arg Leu Ser 
50 55 60 

age atg aac ccg gtt gag cgc cgt gtg cag cgc eta ctg tta ggt cat 240 
Ser Met Asn Pro Val Glu Arg Arg Val Gin Arg Leu Leu Leu Gly His 
65 70 75 80 

gee age gaa tgt cag atg gat ggc gca ggt cga ttg tta ate gcg cca 288 
Ala Ser Glu Cys Gin Met Asp Gly Ala Gly Arg Leu Leu He Ala Pro 
85 90 95 

gta ctg egg caa cat gee ggg ctg aca aaa gaa gtg atg ctg gtt gga 336 
Val Leu Arg Gin His Ala Gly Leu Thr Lys Glu Val Met Leu Val Gly 
100 105 110 

cag ttc aac aag ttt gag ctg tgg gat gaa aca acc tgg cat caa cag 384 
Gin Phe Asn Lys Phe Glu Leu Trp Asp Glu Thr Thr Trp His Gin Gin 
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115 120 125 

gtc aag gaa gat ate gac gca gag cag ttg get acc gga gac tta teg 432 
Val Lys Glu Asp lie Asp Ala Glu Gin Leu Ala Thr Gly Asp Leu Ser 
130 135 140 

gag cga ctg cag gac ttg tct eta taa 459 
Glu Arg Leu Gin Asp Leu Ser Leu * 
145 ~ 150 



<210> 253 
<211> 942 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (942) 

<400> 253 

atg atg gaa aac tat aaa cat act acg gtg ctg ctg gat gaa gec gtt 48 
Met Met Glu Asn Tyr Lys His Thr Thr Val Leu Leu Asp Glu Ala Val 
1 5 10 15 

aat ggc etc aat ate cgt cct gat ggc ate tac att gat ggg act ttt 96 
Asn Gly Leu Asn lie Arg Pro Asp Gly lie Tyr lie Asp Gly Thr Phe 
20 25 30 

ggt cgc ggt ggt cac tea cgt ctg ate etc teg cag ctt ggc gaa gag 144 
Gly Arg Gly Gly His Ser Arg Leu rie Leu Ser Gin Leu Gly Glu Glu 
35 40 45 

555 cgt ttg ctg gcg ate gat cgc gac ccg cag get ate gee gtt gcg 192 
Gly Arg Leu Leu Ala lie Asp Arg Asp Pro Gin Ala He Ala Val Ala 
50 55 60 

aag act att gat gat ccg cgc ttc tec ate ate cac gga cct ttc tec 240 
Lys Thr He Asp Asp Pro Arg Phe Ser He He His Gly Pro Phe Ser 
65 70 75 80 

gcg ctg ggc gaa tac gtt gec gag cgc gat ctt ate ggc aag ate gac 288 
Ala Leu Gly Glu Tyr Val Ala Glu Arg Asp Leu He Gly Lys He Asp 
85 90 95 

ggc att etc etc gat ctt ggc gtc tct tea ccg caa ctt gat gat get 336 
Gly He Leu Leu Asp Leu Gly Val Ser Ser Pro Gin Leu Asp Asp Ala 
100 105 110 

gaa cgt ggc ttt tec ttt atg cgc gat ggt ccg ctg gac atg cgt atg 384 
Glu Arg Gly Phe Ser Phe Met Arg Asp Gly Pro Leu Asp Met Arg Met 
115 120 125 

gac eca acc cgt ggg cag tea gec get gaa tgg eta caa acc gca gaa 432 
Asp Pro Thr Arg Gly Gin Ser Ala Ala Glu Trp Leu Gin Thr Ala Glu 
130 135 140 

gaa gee gat ate gec tgg gta ttg aaa acc tat ggt gaa gag cgt ttt 480 
Glu Ala Asp He Ala Trp Val Leu Lys Thr Tyr Gly Glu Glu Arg Phe 
145 150 155 160 
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gcc aaa cgc att gcc cgc gcc att gtc gag cgt aac cgc gaa cag ccg 528 
Ala Lys Arg lie Ala Arg Ala lie Val Glu Arg Asn Arg Glu Gin Pro 
165 170 175 

atg acc cgc acc aaa gaa ctg gcg gaa gtc gtg get get gca acg ccg 576 
Met Thr Arg Thr Lys Glu Leu Ala Glu Val Val Ala Ala Ala Thr Pro 
180 185 190 

gtg aaa gat aag ttt aaa cat ccc gcg acc cgt acc ttc cag gcg gtg 624 
Val Lys Asp Lys Phe Lys His Pro Ala Thr Arg Thr Phe Gin Ala Val 
195 200 205 

cgc att tgg gta aac agt gaa ctg gag gag ata gag cag gcg eta aaa 672 
Arg lie Trp Val Asn Ser Glu Leu Glu Glu lie Glu Gin Ala Leu Lys 
210 215 220 

age teg etc aac gtg ctg gcc ccg ggt ggg egg ctt teg ate ate age 720 
Ser Ser Leu Asn Val Leu Ala Pro Gly Gly Arg Leu Ser lie lie Ser 
225 230 235 240 

ttc cac teg ctg gaa gac cgt att gtg aaa cgt ttt atg cgt gaa aac 768 
Phe His Ser Leu Glu Asp Arg lie Val Lys Arg Phe Met Arg Glu Asn 
245 250 255 

age cgc ggt ccg caa gtt ccg gca ggg tta ccg atg act gaa gag cag 816 
Ser Arg Gly Pro Gin Val Pro Ala Gly Leu Pro Met Thr Glu Glu Gin 
260 265 270 

etc aaa aaa ctg ggt ggc cgt cag ctg cga gca eta ggc aag tta atg 864 
Leu Lys Lys Leu Gly Gly Arg Gin Leu Arg Ala Leu Gly Lys Leu Met 
275 280 285 

ccg ggc gaa gaa gag gtg get gag aac cct cgt gcc cgt agt tea gtt 912 
Pro Gly Glu Glu Glu Val Ala Glu Asn Pro Arg Ala Arg Ser Ser Val 
290 295 300 



ctg cgt att gca gag agg acg aat gca tga 942 
Leu Arg lie Ala Glu Arg Thr Asn Ala * 
305 310 



<210> 254 
<211> 366 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (366) 

<400> 254 

atg ate age aga gtg aca gaa get eta age aaa gtt aaa gga teg atg 48 
Met lie Ser Arg Val Thr Glu Ala Leu Ser Lys Val Lys Gly Ser Met 
1 5 10 15 



gga age cac gag cgc cat gca ttg 
Gly Ser His Glu Arg His Ala Leu 
20 

ttg cga ttt ggg aag ctg cca etc 
Leu Arg Phe Gly Lys Leu Pro Leu 



cct ggt gtt ate ggt gac gat ctt 96 
Pro Gly Val lie Gly Asp Asp Leu 
25 30 

tgc ctg ttc att tgc att att ttg 144 
Cys Leu Phe lie Cys He lie Leu 
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35 40 45 

acg gcg gtg act gtg gta acc acg gcg cac cat acc cgt tta ctg acc 192 
Thr Ala Val Thr Val Val Thr Thr Ala His His Thr Arg Leu Leu Thr 
50 55 60 

get cag cgc gaa caa ctg gtg ctg gag cga gat get tta gac att gaa 240 
Ala Gin Arg Glu Gin Leu Val Leu Glu Arg Asp Ala Leu Asp lie Glu 
65 70 75 80 

tgg cgc aac ctg ate ctt gaa gag aat gcg etc ggc gac cat age egg 288 
Trp Arg Asn Leu lie Leu Glu Glu Asn Ala Leu Gly Asp His Ser Arg 
B5 90 95 

gtg gaa agg ate gee acg gaa aag ctg caa atg cag cat gtt gat ccg 336 
Val Glu Arg lie Ala Thr Glu Lys Leu Gin Met Gin His Val Asp Pro 
100 105 110 

tea caa gaa aat ate gta gtg caa aaa taa 366 
Ser Gin Glu Asn lie Val Val Gin Lys * 
115 120 



<210> 255 
<211> 1767 
<212> DNA 

<213> Escherichia coli, 

<220> 

<221> CDS 

<222> (1) . - . (1767) 

<400> 255 

atg aaa gca gcg gcg aaa acg cag aaa cca aaa cgt cag gaa gaa cat 48 
Met Lys Ala Ala Ala Lys Thr Gin Lys Pro Lys Arg Gin Glu Glu His 
1 5 10 15 

gec aac ttt ate agt tgg cgt ttt gcg ttg tta tgc ggc tgt att etc 96 
Ala Asn Phe lie Ser Trp Arg Phe Ala Leu Leu Cys Gly Cys lie Leu 
20 25 30 

ctg gcg ctg get ttt ctg etc gga cgc gta gcg tgg tta caa gtt ate 144 
Leu Ala Leu Ala Phe Leu Leu Gly Arg Val Ala Trp Leu Gin Val lie 
35 40 45 

tec ccg gat atg ctg gtg aaa gag ggc gac atg cgt tet ctt cgc gtt 192 
Ser Pro Asp Met Leu Val Lys Glu Gly Asp Met Arg Ser Leu Arg Val 
50 55 60 

cag caa gtt tec acc tec cgc ggc atg att act gac cgt tct ggt cgc 240 
Gin Gin Val Ser Thr Ser Arg Gly Met lie Thr Asp Arg Ser Gly Arg 
65 70 75 80 

ccg tta gcg gtg age gtg ccg gta aaa gcg att tgg get gac ccg aaa 288 
Pro Leu Ala Val Ser Val Pro Val Lys Ala lie Trp Ala Asp Pro Lys 
85 90 " 95 

gaa gtg cat gac get ggc ggt ate age gtc ggt gac cgc tgg aag gcg 336 
Glu Val His Asp Ala Gly Gly lie Ser Val Gly Asp Arg Trp Lys Ala 
IQO 105 110 
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ctg get aac gcg etc aat att ccg ctg gat cag ctt tea gec cgc att 384 
Leu Ala Asn Ala Leu Asn He Pro Leu Asp Gin Leu Ser Ala Arg He 
115 120 125 

aac gee aac ccg aaa ggg cgc ttt att tat ctg gcg cgt cag gtg aac 432 
Asn Ala Asn Pro Lys Gly Arg Phe He Tyr Leu Ala Arg Gin Val Asn 
130 135 140 

cct gac atg gcg gac tac ate aaa aaa ctg aaa ctg ccg ggg att cat 480 
Pro Asp Met Ala Asp Tyr He Lys Lys Leu Lys \^eu Pro Gly He His 
145 150 155 160 

ctg cgt gaa gag tct cgc cgt tac tat ccg tec ggc gaa gtg act get 528 
Leu Arg Glu Glu Ser Arg Arg Tyr Tyr Pro Ser Gly Glu Val Thr Ala 
165 170 175 

cac etc ate ggc ttt act aac gtc gat agt caa ggg att gag ggc gtt 576 
His Leu He Gly Phe Thr Asn Val Asp Ser Gin Gly He Glu Gly Val 
180 185 190 

gag aag agt ttc gat aaa tgg ctt ace ggg cag ccg ggt gag cgc att 624 
Glu Lys Ser Phe Asp Lys Trp Leu Thr Gly Gin Pro Gly Glu Arg He 
195 200 205 

gtg cgt aaa gac cgc tat ggt cgc gta att gaa gat att tct tct act 672 
Val Arg Lys Asp Arg Tyr Gly Arg Val He Glu Asp He Ser Ser Thr 
210 215 220 

gac age cag gca gcg cac aac ctg gcg ctg agt att gat gaa cgc ctg 720 
Asp Ser Gin Ala Ala His Asn Leu Ala Leu Ser He Asp Glu Arg Leu 
225 230 235 240 

cag gcg ctg gtt tat cgc gaa ctg aac aac gcg gtg gee ttt aac aag 768 
Gin Ala Leu Val Tyr Arg Glu Leu Asn Asn Ala Val Ala Phe Asn Lys 
245 250 255 

get gaa tct ggt age gee gtg ctg gtg gat gtc aac acc ggt gaa gtg 816 
Ala Glu Ser Gly Ser Ala Val Leu Val Asp Val Asn Thr Gly Glu Val 
260 265 270 

ctg gcg atg get aac age ccg tea tac aac cct aac aat ctg age ggc 864 
Leu Ala Met Ala Asn Ser Pro Ser Tyr Asn Pro Asn Asn Leu Ser Gly 
275 280 285 

acg ccg aaa gag gcg atg cgt aac cgt acc ate acc gac gtg ttt gaa 912 
Thr Pro Lys Glu Ala Met Arg Asn Arg Thr He Thr Asp Val Phe Glu 
290 295 300 

ccg ggc tea acg gtt aaa ccg atg gtg gta atg acc gcg ttg caa cgt 960 
Pro Gly Ser Thr Val Lys Pro Met Val Val Met Thr Ala Leu Gin Arg 
305 310 315 320 

ggc gtg gtg egg gaa aac teg gta etc aat acc att cct tat cga att 1008 
Gly Val Val Arg Glu Asn Ser Val Leu Asn Thr He Pro Tyr Arg He 
325 330 335 

aac ggc cac gaa ate aaa gac gtg gca cgc tac age gaa tta acc ctg 1056 
Asn Gly His Glu He Lys Asp Val Ala Arg Tyr Ser Glu Leu Thr Leu 
340 345 350 

acc ggg gta. tta cag aag teg agt aac gtc ggt gtt tec aag ctg gcg 1104 
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Thr Gly Val Leu Gin Lys Ser Ser Asn Val Gly Val Ser Lys Leu Ala 
355 360 365 

tta gcg atg ccg tec tea gcg tta gta gat act tac tea cgt ttt gga 1152 
Leu Ala Met Pro Ser Ser Ala Leu Val Asp Thr Tyr Ser Arg Phe Gly 
370 375 380 

ctg gga aaa gcg acc aat ttg ggg ttg gtc gga gaa cgc agt ggc tta 1200 
Leu Gly Lys Ala Thr Asn Leu Gly Leu Val Gly Glu Arg Ser Gly Leu 
385 390 395 400 

tat cct caa aaa caa egg tgg tct gac ata gag agg gee acc ttc tct 1248 
Tyr Pro Gin Lys Gin Arg Trp Ser Asp lie Glu Arg Ala Thr Phe Ser 
405 410 • 415 

ttc ggc tac ggg eta atg gta aca cca tta cag tta gcg cga gtc tac 1296 
Phe Gly Tyr Gly Leu Met Val Thr Pro Leu Gin Leu Ala Arg Val Tyr 
420 425 430 

gca act ate ggc age tac ggc att tat cgc cca ctg teg att acc aaa 1344 
Ala Thr lie Gly Ser Tyr Gly lie Tyr Arg Pro Leu Ser lie Thr Lys 
435 440 445 

gtt gac ccc ccg gtt ccc ggt gaa cgt gtc ttc ccg gaa tec att gtc 1392 
Val Asp Pro Pro Val Pro Gly Glu Arg Val Phe Pro Glu Ser lie Val 
450 455 460 

cgc act gtg gtg cat atg atg gaa age gtg gcg eta cca ggc ggc ggc 1440 
Arg Thr Val Val His Met Met Glu Ser Val Ala Leu Pro Gly Gly Gly 
465 470 475 480 

ggc gtg aag gcg gcg att aaa ggc tat cgt ate gee att aaa acc ggt 1488 
Gly Val Lys Ala Ala lie Lys Gly Tyr Arg lie Ala lie Lys Thr Gly 
485 490 495 

acc gcg aaa aag gtc ggg ccg gac ggt cgc tac ate aat aaa tat att 1536 
Thr Ala Lys Lys Val Gly Pro Asp Gly Arg Tyr lie Asn Lys Tyr lie 
500 505 510 

get tat acc gca ggc gtt gcg cct gcg agt cag ccg cgc ttc gcg ctg 1584 
Ala Tyr Thr Ala Gly Val Ala Pro Ala Ser Gin Pro Arg Phe Ala Leu 
515 520 525 

gtt gtt gtt ate aac gat ccg cag gcg ggt aaa tac tac ggc ggc gee 1632 
Val Val Val lie Asn Asp Pro Gin Ala Gly Lys Tyr Tyr Gly Gly Ala 
530 535 540 

gtt tec gcg ccg gtc ttt ggt gec ate atg ggc ggc gta ttg cgt acc 1680 
Val Ser Ala Pro Val Phe Gly Ala lie Met Gly Gly Val Leu Arg Thr 
545 550 555 560 

atg aac ate gag ccg gat gcg ctg aca acg ggc gat aaa aat gaa ttt 1728 
Met Asn lie Glu Pro Asp Ala Leu Thr Thr Gly Asp Lys Asn Glu Phe 
565 570 575 

» 

gtg att aat caa ggc gag ggg aca ggt ggc aga teg taa 1767 
Val lie Asn Gin Gly Glu Gly Thr Gly Gly Arg Ser * 
580 585 
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<211> 1488 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1488) 

<400> 256 

gtg gca gat cgt aat ttg cgc gac ctt ctt get ccg tgg gtg cca gac 48 
Met Ala Asp Arg Asn Leu Arg Asp Leu Leu Ala Pro Trp Val Pro Asp 
15 10 15 

gca cct teg cga gca ctg cga gag atg aca etc gac age cgt gtg get 96 
Ala Pro Ser Arg Ala Leu Arg Glu Met Thr Leu Asp Ser Arg Val Ala 
20 25 30 

gcg gcg ggc gat etc ttt gta get gta gta ggt cat cag gcg gac ggg 144 
Ala Ala Gly Asp Leu Phe Val Ala Val Val Gly His Gin Ala Asp Gly 
35 40 45 

cgt cga tat ate ccg cag gcg ata gcg caa ggt gtg get gee att att 192 
Arg Arg Tyr lie Pro Gin Ala lie Ala Gin Gly Val Ala Ala He He 
50 55 60 

gca gag gcg aaa gat gag gcg acc gat ggt gaa ate cgt gaa atg cac 240 
Ala Glu Ala Lys Asp Glu Ala Thr Asp Gly Glu He Arg Glu Met His 
65 70 75 80 

ggc gta ccg gtc ate tat etc age cag etc aac gag cgt tta tct gca 288 
Gly Val Pro Val He Tyr Leu Ser Gin Leu Asn Glu Arg Leu Ser Ala 
85 90 95 

ctg gcg ggc cgc ttt tac cat gaa ccc tct gac aat tta cgt etc gtg 336 
Leu Ala Gly Arg Phe Tyr His Glu Pro Ser Asp Asn Leu Arg Leu Val 
100 105 110 

ggc gta acg ggc acc aac ggc aaa acc acg act acc cag ctg ttg gcg 384 
Gly Val Thr Gly Thr Asn Gly Lys Thr Thr Thr Thr Gin Leu Leu Ala 
115 120 125 

ca 9 ^9 a 9 c caa ct 9 ctt 9ST C g aa atc a g c gcg gt a a tg ggc acc gtt 432 
Gin Trp Ser Gin Leu Leu Gly Glu He Ser Ala Val Met Gly Thr Val 
130 135 140 

ggt aac ggc ctg ctg ggg aaa gtg atc ccg aca gaa aat aca acc ggt 480 
Gly Asn Gly Leu Leu Gly Lys Val He Pro Thr Glu Asn Thr Thr Gly 
145 150 155 160 

teg gca gtc gat gtt cag cat gag ctg gcg ggg ctg gtg gat cag ggc 528 
Ser Ala Val Asp Val Gin His Glu Leu Ala Gly Leu Val Asp Gin Gly 
165 170 175 

gcg acg ttt tgc gca atg gaa gtt tec tec cac ggg ctg gta cag cac 576 
Ala Thr Phe Cys Ala Met Glu Val Ser Ser His Gly Leu Val Gin His 
180 185 190 

cgt gtg gcg gca ttg aaa ttt gcg gcg teg gtc ttt acc aac tta age 624 
Arg- Val Ala Ala Leu Lys Phe Ala Ala Ser Val Phe Thr Asn Leu Ser 
195 200 205 
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cgc gat cac ctt gat tat cat ggt gat atg gaa cac tac gaa gcc gcg 672 
Arg Asp His Leu Asp Tyr His Gly Asp Met Glu His Tyr Glu Ala Ala 
210 215 220 

aaa tgg ctg ctt tat tct gag cat cat tgc ggt cag gcg att att aac 720 
Lys Trp Leu Leu Tyr Ser Glu His His Cys Gly Gin Ala lie lie Asn 
225 230 235 240 

gcc gac gat gaa gtg ggc cgc cgc tgg ctg gca aaa ctg ccg gac gcg 768 
Ala Asp Asp Glu Val Gly Arg Arg Trp Leu Ala Lys Leu Pro Asp Ala 
245 250 255 

\ 

gtt gcg gta tea atg gaa gat cat att aat ccg aac tgt cac gga cgc 816 
Val Ala Val Ser Met Glu Asp His He Asn Pro Asn Cys His Gly Arg 
260 265 270 

tgg ttg aaa gcg acc gaa gtg aac tat cac gac age ggt gcg acg att 864 
Trp Leu Lys Ala Thr Glu Val Asn Tyr His Asp Ser Gly Ala Thr He 
275 280 285 

cgc ttt age tea agt tgg ggc gat ggc gaa att gaa age cat ctg atg 912 
Arg Phe Ser Ser Ser Trp Gly Asp Gly Glu He Glu Ser His Leu Met 
290 295 300 

ggc get ttt aac gtc age aac ctg ctg etc gcg ctg gcg aca ctg ttg 960 
Gly Ala Phe Asn Val Ser Asn Leu Leu Leu Ala Leu Ala Thr lieu Leu 
305 310 315 320 

gca etc ggc tat cca ctg get gat ctg ctg aaa acc gcc gcg cgt ctg 1008 
Ala Leu Gly Tyr Pro Leu Ala Asp Leu Leu Lys Thr Ala Ala Arg Leu 
325 330 335 

caa ccg gtt tgc gga cgt atg gaa gtg ttc act gcg cca ggc aaa ccg 1056 
Gin Pro Val Cys Gly Arg Met Glu Val Phe Thr Ala Pro Gly Lys Pro 
340 345 350 

acg gtg gtg gtg gat tac gcg cat acg ccg gat gca ctg gaa aaa gcc 1104 
Thr Val Val Val Asp Tyr Ala His Thr Pro Asp Ala Leu Glu Lys Ala 
355 360 365 

tta cag gcg gcg cgt ctg cac tgt gcg ggc aag ctg tgg tgt gtc ttt 1152 
Leu Gin Ala Ala Arg Leu His Cys Ala Gly Lys Leu Trp Cys Val Phe 
370 375 380 

ggc tgt ggt ggc gat cgc gat aaa ggt aag cgt cca ctg atg ggc gca 1200 
Gly Cys Gly Gly Asp Arg Asp Lys Gly Lys Arg Pro Leu Met Gly Ala 
385 390 395 400 

att gcc gaa gag ttt get gac gtg gcg gtg gtg acg gac gat aac ccg 1248 
He Ala Glu Glu Phe Ala Asp Val Ala Val Val Thr Asp Asp Asn Pro 
405 410 415 

cgt acc gaa gaa ccg cgt gcc ate ate aac gat att ctg gcg gga atg 1296 
Arg Thr Glu Glu Pro Arg Ala He He Asn Asp He Leu Ala Gly Met 
420 425 430 

tta gat gcc gga cat gcc aaa gtg atg gaa ggc cgt get gaa gcg gtg 1344 
Leu Asp Ala Gly His Ala Lys Val Met Glu Gly Arg Ala Glu Ala Val 
435 440 445 

act tgc gcc gtt atg cag get aaa gag aat gat gtg gta ctg gtc gcg 1392 
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Thr Cys Ala Val Met Gin Ala Lys Glu Asn Asp Val Val Leu Val Ala 
450 455 460 

ggc aaa ggc cat gaa gat tac cag att gtt ggc aat cag cgt ctg gac 1440 
Gly Lys Gly His Glu Asp Tyr Gin lie Val Gly Asn Gin Arg Leu Asp 
465 470 475 480 

tac tec gat cgc gtc acg gtg gcg cgt ctg ctg ggg gtg att gca tga 1488 
Tyr Ser Asp Arg Val Thr Val Ala Arg Leu Leu Gly Val lie Ala * 
485 490 495 



<210> 257 
<211> 1359 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ... (1359) 

<400'> 257 

atg att age gta acc ctt age caa ctt acc gac att etc aac ggt gaa 48 
Met lie Ser Val Thr Leu Ser Gin Leu Thr Asp He Leu Asn Gly Glu 
15 10 15 

ctg caa ggt gca gat ate acc ctt gat get gta acc act gat acc cga 96 
Leu Gin Gly Ala Asp lie Thr Leu Asp Ala Val Thr Thr Asp Thr Arg 
20 25 30 

aaa ctg acg ccg ggc tgc ctg ttt gtt gec ctg aaa ggc gaa cgt ttt 144 
Lys Leu Thr Pro Gly Cys Leu Phe Val Ala Leu Lys Gly Glu Arg Phe 
35 40 45 

gat gec cac gat ttt gec gac cag gcg aaa get ggc ggc gca ggc gca 192 
Asp Ala Hia Asp Phe Ala Asp Gin Ala Lys Ala Gly Gly Ala Gly Ala 
50 55 60 

eta ctg gtt age cgt ccg ctg gac ate gac ctg ccg cag tta ate gtc 240 
Leu Leu Val Ser Arg Pro Leu Asp He Asp Leu Pro Gin Leu He Val 
65 70 75 80 

aag gat acg cgt ctg gcg ttt ggt gaa ctg get gca tgg gtt cgc cag 288 
Lys Asp Thr Arg Leu Ala Phe Gly Glu Leu Ala Ala Trp Val Arg Gin 
85 90 95 

caa gtt ccg gcg cgc gtg gtt get ctg acg ggg tec tec ggc aaa acc 336 
Gin Val Pro Ala Arg Val Val Ala Leu Thr Gly Ser Ser Gly Lys Thr 
100 105 110 

tec gtt aaa gag atg acg gcg gcg att tta age cag tgc ggc aac acg 384 
Ser Val Lys Glu Met Thr Ala Ala He Leu Ser Gin Cys Gly Asn Thr 
115 m 120 125 

ctt tat acg gca ggc aat etc aac aac gac ate ggt gta ccg atg acg 432 
Leu Tyr Thr Ala Gly Asn Leu Asn Asn Asp He Gly Val Pro Met Thr 
130 135 140 

ctg ttg cgc tta acg ccg gaa tac gat tac gca gtt att gaa ctt ggc 480 
Leu Leu Arg Leu Thr Pro Glu Tyr Asp Tyr Ala Val He Glu Leu Gly 
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145 150 155 160 

gcg aac cat cag ggc gaa ata gcc tgg act gtg agt ctg act cgc ccg 528 
Ala Asn His Gin Gly Glu lie Ala Trp Thr Val Ser Leu Thr Arg Pro 
165 170 175 

gaa get gcg ctg gtc aac aac ctg gca gcg gcg cat ctg gaa ggt ttt 576 
Glu Ala Ala Leu Val Asn Asn Leu Ala Ala Ala His Leu Glu Gly Phe 
180 185 190 

ggc teg ctt gcg ggt gtc gcg aaa gcg aaa ggt gaa ate ttt age ggc 624 
Gly Ser Leu Ala Gly Val Ala Lys Ala Lys Gly Glu lie Phe Ser Gly 
195 200 205 

ctg ccg gaa aac ggt ate gcc att atg aac gcc gac aac aac gac tgg 672 
Leu Pro Glu Asn Gly lie Ala lie Met Asn Ala Asp Asn Asn Asp Trp 
210 215 220 

ctg aac tgg cag age gta att ggc tea cgc aaa gtg tgg cgt ttc tea 720 
Leu Asn Trp Gin Ser Val lie Gly Ser Arg Lys Val Trp Arg Phe Ser 
225 230 235 240 

ccc aat gcc gcc aac age gat ttc ace gcc ace aat ate cat gtg ace 768 
Pro Asn Ala Ala Asn Ser Asp Phe Thr Ala Thr Asn lie His Val Thr 
245 250 255 

teg cac ggt acg gaa ttt acc eta caa ace cca acc ggt age gtc gat 816 
Ser His Gly Thr Glu Phe Thr Leu Gin Thr Pro Thr Gly Ser Val Asp 
260 265 270 

gtt ctg ctg ccg ttg ccg ggg cgt cac aat att gcg aat gcg ctg gca . 864 
Val Leu Leu Pro Leu Pro Gly Arg His Asn lie Ala Asn Ala Leu Ala 
275 280 285 

gcc get gcg etc tec atg tec gtg ggc gca acg ctt gat get ate aaa 912 
Ala Ala Ala Leu Ser Met Ser Val Gly Ala Thr Leu Asp Ala lie Lys 
290 295 300 

gcg ggg ctg gca aat ctg aaa get gtt cca ggc cgt ctg ttc ccc ate 960 
Ala Gly Leu Ala Asn Leu Lys Ala Val Pro Gly Arg Leu Phe Pro lie 
305 310 315 320 

caa ctg gca gaa aac cag ttg ctg etc gac gac tec tac aac gcc aat 1008 
Gin Leu Ala Glu Asn Gin Leu Leu Leu Asp Asp Ser Tyr Asn Ala Asn 
325 330 335 

gtc ggt tea atg act gca gca gtc cag gta ctg get gaa atg ccg ggc 1056 
Val Gly Ser Met Thr Ala Ala Val Gin Val Leu Ala Glu Met Pro Gly 
340 345 350 

tac cgc gtg ctg gtg gtg ggc gat atg gcg gaa ctg ggc get gaa age 1104 
Tyr Arg Val Leu Val Val Gly Asp Met Ala Glu Leu Gly Ala Glu Ser 
355 360 365 

gaa gcc tgc cat gta cag gtg ggc gag gcg gca aaa get get ggt att 1152 
Glu Ala Cys His Val Gin Val Gly Glu Ala Ala Lys Ala Ala Gly lie 
370 375 380 

gac cgc gtg tta age gtg ggt aaa caa age cat get ate age acc gcc 1200 
Asp Arg Val Leu Ser Val Gly Lys Gin Ser His Ala lie Ser Thr Ala 
385 390 395 400 
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age ggc gtt ggc gaa cat ttt get gat aaa act gcg tta att acg cgt 1248 
Ser Gly Val Gly Glu His Phe Ala Asp Lys Thr Ala Leu He Thr Arg 
405 410 415 

ctt aaa tta ctg att get gag caa cag gta att acg att tta gtt aag 1296 
Leu Lys Leu Leu He Ala Glu Gin Gin Val He Thr He Leu Val Lys 
420 425 430 

ggt tea cgt agt gee gee atg gaa gag gta gta cgc get tta cag gag 1344 
Gly Ser Arg Ser Ala Ala Met Glu Glu Val Val Arg Ala Leu Gin Glu 
435 440 445 

aat ggg aca tgt tag 1359 
Asn Gly Thr Cys * 
450 



<210> 258 
<211> 1083 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1083) 

<400> 258 

atg tta gtt tgg ctg gee gaa cat ttg gtc aaa tat tat tec ggc ttt 48 
Met Leu Val Trp Leu Ala Glu His Leu Val Lys Tyr Tyr Ser Gly Phe 
15 10 15 

aac gtc ttt tec tat ctg acg ttt cgc gee ate gtc age ctg ctg acc 96 
Asn Val Phe Ser Tyr Leu Thr Phe Arg Ala He Val Ser Leu Leu Thr 
20 25 30 

gcg ctg ttc ate tea ttg tgg atg ggc ccg cgt atg att get cat ttg 144 
Ala Leu Phe He Ser Leu Trp Met Gly Pro Arg Met He Ala His Leu 
35 40 45 

caa aaa ctt tec ttt ggt cag gtg gtg cgt aac gac ggt cct gaa tea 192 
Gin Lys Leu Ser Phe Gly Gin Val Val Arg Asn Asp Gly Pro Glu Ser 
50 55 60 

cac ttc age aag cgc ggt acg ccg acc atg ggc ggg att atg ate ctg 240 
His Phe Ser Lys Arg Gly Thr Pro Thr Met Gly Gly He Met He Leu 
65 70 75 80 

acg gcg att gtg ate tec gta ctg ctg tgg get tac ccg tec aat ccg 288 
Thr Ala He Val He Ser Val Leu Leu Trp Ala Tyr Pro Ser Asn Pro 
85 90 95 

tac gtc tgg tgc gtg ttg gtg gtg ctg gta ggt tac ggt gtt att ggc 336 
Tyr Val Trp Cys Val Leu Val Val Leu Val Gly Tyr Gly Val He Gly 
100 105 110 

ttt gtt gat gat tat cgc aaa gtg gtg cgt aaa gac acc aaa ggg ttg 384 
Phe Val Asp Asp Tyr Arg Lys Val Val Arg Lys Asp Thr Lys Gly Leu 
115 120 125 

ate get cgt tgg aag tat ttc tgg atg teg gtc att gcg ctg ggt gtc 432 
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lie Ala Arg Trp Lys Tyr Phe Trp Met Ser Val He Ala Leu Gly Val 
130 135 140 

gcc ttc gcc ctg tac ctt gcc ggc aaa gac acg ccc gca acg cag ctg 480 
Ala Phe Ala Leu Tyr Leu Ala Gly Lys Asp Thr Pro Ala Thr Gin Leu 
145 150 155 160 

gtg gtc cca ttc ttt aaa gat gtg atg ccg cag ctg ggg ctg ttc tac 528 
Val Val Pro Phe Phe Lys Asp Val Met Pro Gin Leu Gly Leu Phe Tyr 
165 170 175 

att ctg ctg get tac ttc gtc att gtg ggt act ggc aac gcg gta aac 576 
He Leu Leu Ala Tyr Phe Val He Val Gly Thr Gly Asn Ala Val Asn 
180 185 190 

ctg acc gat ggt etc gac ggc ctg gca att atg ccg acc gta ttt gtc 624 
Leu Thr Asp Gly Leu Asp Gly Leu Ala He Met Pro Thr Val Phe Val 
195 200 205 

gcc ggt ggt ttt gcg ctg gtg gcg tgg gcg acc ggc aat atg aac ttt 672 
Ala Gly Gly Phe Ala Leu Val Ala Trp Ala Thr Gly Asn Met Asn Phe 
210 215 220 

gcc age tac ttg cat ata ccg tat ctg cga cac gcc ggg gaa ctg gtt 720 
Ala Ser Tyr Leu His He Pro Tyr Leu Arg His Ala Gly Glu Leu Val 
225 230 235 240 

att gtc tgt acc gcg ata gtc ggg gca gga ctg ggc ttc ctg tgg ttt 768 
He Val Cys Thr Ala He Val Gly Ala Gly Leu Gly Phe Leu Trp Phe 
245 250 255 

aac acc tat ccg gcg cag gtc ttt atg ggc gat gta ggt teg ctg gcg 816 
Asn Thr Tyr Pro Ala Gin Val Phe Met Gly Asp Val Gly Ser Leu Ala 
260 265 270 

tta ggt ggt gcg tta ggc att ate gcc gta ctg eta cgt cag gaa ttc 864 
Leu Gly Gly Ala Leu Gly He He Ala Val Leu Leu Arg Gin Glu Phe 
275 280 285 



ctg ctg gtg att atg ggg ggc gtg ttc gtg gta gaa acg ctt tct gtc 912 
Leu Leu Val He Met Gly Gly Val Phe Val Val Glu Thr Leu Ser Val 
290 295 300 

ate ctg cag gtc ggc tec ttt aaa ctg cgc gga caa cgt att ttc cgc 960 
He Leu Gin Val Gly Ser Phe Lys Leu Arg Gly Gin Arg He Phe Arg 
305 310 315 320 

atg gca ccg att cat cac cac tat gaa ctg aaa ggc tgg ccg gaa ccg 1008 
Met Ala Pro He His His His Tyr Glu Leu Lys Gly Trp Pro Glu Pro 
325 330 335 

cgc gtc att gtg cgt ttc tgg att att teg ctg atg ctg gtt ctg att 1056 
Arg Val He Val Arg Phe Trp He He Ser Leu Met Leu Val Leu He 
340 345 350 

ggt ctg gca acg ctg aag gta cgt taa 1083 
Gly Leu Ala Thr Leu Lys Val Arg * 
355 360 



<210> 259 
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<211> 1317 
<212> UNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) „ . . (1317) 

<400> 259 

atg get gat tat cag ggt aaa aat gtc gtc att ate ggc ctg ggc etc 48 
Met Ala Asp Tyr Gin Gly Lys Asn Val Val lie lie Gly Leu Gly Leu 
15 10 15 

acc ggg ctt tec tgc gtg gac ttt ttc etc get cgc ggt gtg acg ccg 96 
Thr Gly Leu Ser Cys Val Asp Phe Phe Leu Ala Arg Gly Val Thr Pro 
20 25 30 

cgc gtt atg gat acg cgt atg aca ccg cct ggc ctg gat aaa tta ccc 144 
Arg Val Met Asp Thr Arg Met Thr Pro Pro Gly Leu Asp Lys Leu Pro 
35 40 45 

gaa gee gta gaa cgc cac acg ggc agt ctg aat gat gaa tgg ctg atg 192 
Glu Ala Val Glu Arg His Thr Gly Ser Leu Asn Asp Glu Trp Leu Met 
50 55 60 

gcg gca gat ctg att gtc gec agt ccc ggt att gca ctg gcg cat cca 240 
Ala Ala Asp Leu He Val Ala Ser Pro Gly He Ala Leu Ala His Pro 
65 70 75 80 

tec tta age get gee get gat gee gga ate gaa ate gtt ggc gat ate 288 
Ser Leu Ser Ala Ala Ala Asp Ala Gly He Glu He Val Gly Asp He 
85 90 95 

gag ctg ttc tgt cgc gaa gca caa gca ccg att gtg gcg att acc ggt 336 
Glu Leu Phe Cys Arg Glu Ala Gin Ala Pro He Val Ala He Thr Gly 
100 105 110 

tct aac ggc aaa age acg gtc acc acg eta gtg ggt gaa atg gcg aaa 384 
Ser Asn Gly Lys Ser Thr Val Thr Thr Leu Val Gly Glu Met Ala Lys 
115 120 125 

gcg gcg ggg gtt aac gtt ggt gtg ggt ggc aat att ggc ctg cct gcg 432 
Ala Ala Gly Val Asn Val Gly Val Gly Gly Asn He Gly Leu Pro Ala 
130 135 140 

ttg atg eta ctg gat gat gag tgt gaa ctg tac gtg ctg gaa ctg teg 480 
Leu Met Leu Leu Asp Asp Glu Cys Glu Leu Tyr Val Leu Glu Leu Ser 
145 150 155 160 

age ttc cag ctg gaa acc acc tec age tta cag gcg gta gca gcg acc 52 B 
Ser Phe Gin Leu Glu Thr Thr Ser Ser Leu Gin Ala Val Ala Ala Thr 
165 170 175 

att ctg aac gtg act gaa gat cat atg gat cgc tat ccg ttt ggt tta 576 
He Leu Asn Val Thr Glu Asp His Met Asp Arg Tyr Pro Phe Gly Leu 
180 185 190 

caa cag tat cgt gca gca aaa ctg cgc att tac gaa aac gcg aaa gtt 624 
Gin Gin Tyr Arg Ala Ala Lys Leu Arg He Tyr Glu Asn Ala Lys Val 
195 200 205 
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tgc gtg gtt aat get gat gat gec tta aca atg ccg att cgc ggt gcg 672 
Cys Val Val Asn Ala Asp Asp Ala Leu Thr Met Pro lie Arg Gly Ala 
210 215 220 

gat gaa cgc tgc gtc age ttt ggc gtc aac atg ggt gac tat cac ctg 720 
Asp Glu Arg Cys Val Ser Phe Gly Val Asn Met Gly Asp Tyr His Leu 
225 230 235 240 

aat cat cag cag ggc gaa acc tgg ctg egg gtt aaa ggc gag aaa gtg 768 
Asn His Gin Gin Gly Glu Thr Trp Leu Arg Val Lys Gly Glu Lys Val 
245 250 255 

ctg aat gtg aaa gag atg aaa ctt tec ggg cag cat aac tac acc aat 816 
Leu Asn Val Lys Glu Met Lys Leu Ser Gly Gin His Asn Tyr Thr Asn 
260 265 270 

gcg ctg gcg gcg ctg gcg ctg gca gat get gca ggg tta ccg cgt gee 864 
Ala Leu Ala Ala Leu Ala Leu Ala Asp Ala Ala Gly Leu Pro Arg Ala 
275 280 285 

age age ctg aaa gcg tta acc aca ttc act ggt ctg ccg cat cgc ttt 912 
Ser Ser Leu Lys Ala Leu Thr Thr Phe .Thr Gly Leu Pro His Arg Phe 
290 295 300 

gaa gtt gtg ctg gag cat aac ggc gta cgt tgg att aac gat teg aaa 960 
Glu Val Val Leu Glu His Asn Gly Val Arg Trp He Asn Asp Ser Lys 
305 310 315 320 

gcg acc aac gtc ggc agt acg gaa gcg gcg ctg aat ggc ctg cac gta 1008 
Ala Thr Asn Val Gly Ser Thr Glu Ala Ala Leu Asn Gly Leu His Val 
325 330 335 

gac ggc aca ctg cat ttg ttg ctg ggt ggc gat ggt aaa teg gcg gac 1056 
Asp Gly Thr Leu His Leu Leu Leu Gly Gly Asp Gly Lys Ser Ala Asp 
340 345 350 

ttt age cca ctg gcg cgt tac ctg aat ggc gat aac gta cgt ctg tat 1104 
Phe Ser Pro Leu Ala Arg Tyr Leu Asn Gly Asp Asn Val Arg Leu Tyr 
355 360 365 

tgt ttc ggt cgt gac ggc gcg cag ctg gcg gcg eta cgc ccg gaa gtg 1152 
Cys Phe Gly Arg Asp Gly Ala Gin Leu Ala Ala Leu Arg Pro Glu Val 
370 375 380 

gca gaa caa acc gaa act atg gaa cag gcg atg cgc ttg ctg get ccg 1200 
Ala Glu Gin Thr Glu Thr Met Glu Gin Ala Met Arg Leu Leu Ala Pro 
385 390 395 400 

cgt gtt cag ccg ggc gat atg gtt ctg etc tec cca gec tgt gec age 1248 
Arg Val Gin Pro Gly Asp Met Val Leu Leu Ser Pro Ala Cys Ala Ser 
405 410 415 

ctt gat cag ttc aag aac ttt gaa caa cga ggc aat gag ttt gec cgt 1296 
Leu Asp Gin Phe Lys Asn Phe Glu Gin Arg Gly Asn Glu Phe Ala Arg 
420 425 430 

ctg gcg aag gag tta ggt tga 1317 
Leu Ala Lys Glu Leu Gly * 
435 
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<210> 260 
<211> 1245 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1245) 

<400> 260 

atg cgt tta tct etc cct cgc ctg aaa atg ccg cgc ctg cca gga ttc 48 
Met Arg Leu Ser Leu Pro Arg Leu Lys Met Pro Arg Leu Pro Gly Phe 
15 10 15 

agt ate ctg gtc tgg ate tec acg gcg eta aag ggc tgg gtg atg ggc 96 
Ser lie Leu Val Trp He Ser Thr Ala Leu Lys Gly Tip Val Met Gly 
20 25 30 

teg egg gaa aaa gat ace gac age ctg ate atg tac gat cgc ace tta 144 
Ser Arg Glu Lys Asp Thr Asp Ser Leu He Met Tyr Asp Arg Thr Leu 
35 * 40 45 

ctg tgg ctg acc ttc ggc etc gcg gcg att ggc ttt ate atg gtg acc 192 
Leu Trp Leu Thr Phe Gly Leu Ala Ala He Gly Phe He Met Val Thr 
50 55 60 

teg gcg tea atg ccc ata ggg caa cgc tta acc aac gat ccg ttc ttc 240 
Ser Ala Ser Met Pro He Gly Gin Arg Leu Thr Asia Asp Pro Phe Phe 
65 70 75 80 

ttc gcg aag cgt gat ggt gtc tat ctg att ttg gcg ttt att ctg gcg 288 
Phe Ala Lys Arg Asp Gly Val Tyr Leu He Leu Ala Phe He Leu Ala 
85 90 95 

ate att acg ctg cgt ctg ccg atg gag ttc tgg caa cgc tac agt gee 336 
He He Thr Leu Arg Leu Pro Met Glu Phe Trp Gin Arg Tyr Ser Ala 
100 105 110 

acg atg ctg etc gga tct ate ate ctg ctg atg ate gtc ctg gta gtg 384 
Thr Met Leu Leu Gly Ser He He Leu Leu Met He Val Leu Val Val 
115 120 125 

ggt age teg gtt aaa ggg gca teg cgt tgg ate gat etc ggt ttg ctg 432 
Gly Ser Ser Val Lys Gly Ala Ser Arg Trp He Asp Leu Gly Leu Leu 
130 135 140 

cgt ate cag cct gcg gag ctg aca aaa ctg teg ctg ttt tgc tat ate 480 
Arg He Gin Pro Ala Glu Leu Thr Lys Leu Ser Leu Phe Cys Tyr He 
145 150 155 160 

gee aac tat ctg gtg cgt aaa ggc gac gaa gta cgt aat aac ctg cgc 528 
Ala Asn Tyr Leu Val Arg Lys Gly Asp Glu Val Arg Asn Asn Leu Arg 
165 170 175 

ggc ttc ctg aaa ccg atg ggc gtg att ctg gtg ttg gca gtg tta ctg 576 
Gly Phe Leu Lys Pro Met Gly Val He Leu Val Leu Ala Val Leu Leu 
180 185 190 

ctg gca cag cca gac ctt ggt acg gtg gtg gtg ttg ttt gtg act acg 624 
Leu Ala Gin Pro Asp Leu Gly Thr Val Val Val Leu Phe Val Thr Thr 
195 200 205 
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ctg gcg atg ttg ttc ctg gcg gga gcg aaa ttg tgg cag ttc att gcc 672 
Leu Ala Met Leu Phe Leu Ala Gly Ala Lys Leu Trp Gin Phe lie Ala 
210 215 220 

att ate ggt atg ggc att tea gcg gtt gtg ttg ctg ata etc gcc gaa 720 
lie lie Gly Met Gly He Ser Ala Val Val Leu Leu He Leu Ala Glu 
225 230 235 240 

ccg tac cgt ate cgc cgt gtt acc gca ttc tgg aac ccg tgg gaa gat 768 
Pro Tyr Arg He Arg Arg Val Thr Ala Phe Trp Asn Pro Trp Glu Asp 
245 250 255 

ccc ttt ggc age ggc tat cag tta acg caa teg ctg atg gcg ttt ggt 816 
Pro Phe Gly Ser Gly Tyr Gin Leu Thr Gin Ser Leu Met Ala Phe Gly 
260 265 270 

cgc ggc gaa ctt tgg ggg caa ggt tta ggt aac teg gta caa aaa ctg 864 
Arg Gly Glu Leu Trp Gly Gin Gly Leu Gly Asn Ser Val Gin Lys Leu 
275 280 285 

gag tat ctg ccg gaa gcg cac act gac ttt att ttc gcc att ate ggc 912 
Glu Tyr Leu Pro Glu Ala His Thr Asp Phe He Phe Ala He He Gly 
290 295 300 

gaa gaa ctg ggg tat gtc ggt gtg gtg ctg gca ctt tta atg gta ttc 960 
Glu Glu Leu Gly Tyr Val Gly Val Val Leu Ala Leu Leu Met Val Phe 
305 310 315 320 

ttc gtc get ttt cgc gcg atg teg att ggc cgt aaa gca tta gaa att 1008 
Phe Val Ala Phe Arg Ala Met Ser He Gly Arg Lys Ala Leu Glu He 
325 330 335 

gac cac cgt ttt tec ggt ttt etc gcc tgt tct att ggc ate tgg ttt 1056 
Asp His Arg Phe Ser Gly Phe Leu Ala Cys Ser He Gly He Trp Phe 
340 345 350 

age ttc cag gcg ctg gtt aac gta ggc gcg gcg gcg ggg atg tta ccg 1104 
Ser Phe Gin Ala Leu Val Asn Val Gly Ala Ala Ala Gly Met Leu Pro 
355 360 365 

acc aaa ggt ctg aca ttg ccg ctg ate agt tac ggt ggt teg age tta 1152 
Thr Lys Gly Leu Thr Leu Pro Leu He Ser Tyr Gly Gly Ser Ser Leu 
370 375 380 

ctg att atg teg aca gcc ate atg atg ctg ttg cgt att gat tat gaa 1200 
Leu He Met Ser Thr Ala He Met Met Leu Leu Arg He Asp Tyr Glu 
385 390 395 400 

acg cgt ctg gag aaa gcg cag gcg ttt gta cga ggt tea cga tga 1245 
Thr Arg Leu Glu Lys Ala Gin Ala Phe Val Arg Gly Ser Arg * 
405 410 



<210> 261 
<211> 1068 
<212> DNA 

<213> Escherichia coli . 
<220> 
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<221> CDS 

<222> (1) . (1068) 

<400> 261 

atg agt ggt caa gga aag cga tta atg gtg atg gca ggc gga acc ggt 48 
Met Ser Gly Gin Gly I*ys Arg Leu Met Val Met Ala Gly Gly Thr Gly 
1 5 10 15 

gga cat gta ttc ccg gga ctg gcg gtt gcg cac cat eta atg get cag 96 
Gly His Val Phe Pro Gly Leu Ala Val Ala His His Leu Met Ala Gin 
20 25 30 

ggt tgg caa gtt cgc tgg ctg ggg act gec gac cgt atg gaa gcg gac 144 
Gly Trp Gin Val Arg Trp Leu Gly Thr Ala Asp Arg Met Glu Ala Asp 
35 40 45 

tta gtg cca aaa cat ggc ate gaa att gat ttc att cgt ate tct ggt 192 
Leu Val Pro Lys His Gly He Glu He Asp Phe He Arg He Ser Gly 
50 55 60 

ctg cgt gga aaa ggt at a aaa gca ctg ata get gee ccg ctg cgt ate 240 
Leu Arg Gly Lys Gly He Lys Ala Leu He Ala Ala Pro Leu Arg He 
65 70 75 80 

ttc aac gee tgg cgt cag gcg egg gcg att atg aaa gcg tac aaa cct 288 
Phe Asn Ala Trp Arg Gin Ala Arg Ala He Met Lys Ala Tyr Lys Pro 
85 90 95 

gac gtg gtg etc ggt atg gga ggc tac gtg tea ggt cca ggt ggt ctg 336 

Asp Val Val Leu Gly Met Gly Gly Tyr Val Ser Gly Pro Gly Gly Leu 
100 105 110 

\ 

gec gcg tgg teg tta ggc att ccg gtt gta ctt cat gaa caa aac ggt 384 

Ala Ala Trp Ser Leu Gly He Pro Val Val Leu His Glu Gin Asn Gly 

115 120 125 

att gcg ggc tta acc aat aaa tgg ctg gcg aag att gee acc aaa gtg 432 
He Ala Gly Leu Thr Asn Lys Trp Leu Ala Lys He Ala Thr Lys Val 
130 135 140 

atg cag gcg ttt cca ggt get ttc cct aat gcg gaa gta gtg ggt aac 480 
Met Gin Ala Phe Pro Gly Ala Phe Pro Asn Ala Glu Val Val Gly Asn 
145 150 155 160 

ccg gtg cgt acc gat gtg ttg gcg ctg ccg ttg ccg cag caa cgt ttg 528 
Pro Val Arg Thr Asp Val Leu Ala Leu Pro Leu Pro Gin Gin Arg Leu 
165 170 1,75 

get gga cgt gaa ggt ccg gtt cgt gtg ctg gta gtg ggt ggt tct cag 576 
Ala Gly Arg Glu Gly Pro Val Arg Val Leu Val Val Gly Gly Ser Gin 
180 185 190 

ggc gca cgc att ctt aac cag aca atg ccg cag gtt get gcg aaa ctg 624 
Gly Ala Arg He Leu Asn Gin Thr Met Pro Gin Val Ala Ala Lys Leu 
195 200 205 

ggt gat tea gtc act ate tgg cat cag age ggc aaa ggt teg caa caa 672 
Gly Asp Ser Val Thr He Trp His Gin Ser Gly Lys Gly Ser Gin Gin 
210 215 220 

tec gtt gaa cag gcg tat gee gaa gcg ggg caa ccg cag cat aaa gtg 720 
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Ser Val Glu Gin Ala Tyr Ala Glu Ala Gly Gin Pro Gin His Lys Val 
225 230 235 240 

acg gaa ttt att gat gat atg gcg gcg gcg tat gcg tgg gcg gat gtc 768 
Thr Glu Phe lie Asp Asp Met Ala Ala Ala Tyr Ala Trp Ala Asp Val 
245 250 255 

gtc gtt tgc cgc tec ggt gcg tta acg gtg agt gaa ate gec gcg gca 816 
Val Val Cys Arg Ser Gly Ala Leu Thr Val Ser Glu lie Ala Ala Ala 
260 265 270 

gga eta ccg gcg ttg ttt gtg ccg ttt caa cat aaa gac cgc cag caa 864 
Gly Leu Pro Ala Leu Phe Val Pro Phe Gin His Lys Asp Arg Gin Gin 
275 280 285 

tac tgg aat gcg eta ccg ctg gaa aaa gcg ggc gca gee aaa att ate 912 
Tyr Trp Asn Ala Leu Pro Leu Glu Lys Ala Gly Ala Ala Lys lie lie 
290 295 300 

gag cag cca cag ctt age gtg gat get gtc gee aac acc ctg gec ggg 960 
Glu Gin Pro Gin Leu Ser Val Asp Ala Val Ala Asn Thr Leu Ala Gly 
305 310 315 320 

tgg teg cga gaa acc tta tta acc atg gca gaa cgc gee cgc get gca 1008 
Trp Ser Arg Glu Thr Leu Leu Thr Met Ala Glu Arg Ala Arg Ala Ala 
325 330 335 

tec att ccg gat gee acc gag cga gtg gca aat gaa gtg age egg gtt 1056 
Ser lie Pro Asp Ala Thr Glu Arg Val Ala Asn Glu Val Ser Arg Val 
340 345 350 

gee egg gcg taa 1068 
Ala Arg Ala * 
355 



<210> 262 
<211> 1476 
<212> DMA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1476) 

<400> 262 

atg aat aca caa caa ttg gca aaa ctg cgt tec ate gtg ccc gaa atg 48 
Met Asn Thr Gin Gin Leu Ala Lys Leu Arg Ser lie Val Pro Glu Met 
1 5 ' ' 10 15 

cgt cgc gtt egg cac ata cat ttt gtc ggc att ggt ggt gec ggt atg 96 
Arg Arg Val Arg His lie His Phe Val Gly He Gly Gly Ala Gly Met 
20 25 30 

ggc ggt att gee gaa gtt ctg gee aat gaa ggt tat cag ate agt ggt 144 
Gly Gly He Ala Glu Val Leu Ala Asn Glu Gly Tyr Gin He Ser Gly 
35 40 45 

tec gat tta gcg cca aat ccg gtc acg cag cag tta atg aat ctg ggt 192 
Ser Asp Leu Ala Pro Asn Pro Val Thr Gin Gin Leu Met Asn Leu Gly 
50 55 60 
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gcg acg att tat ttc aac cat cgc ccg gaa aac gta cgt gat gcc age 240 
Ala Thr lie Tyr Phe Asn His Arg Pro Glu Asn Val Arg Asp Ala Ser 
65 70 75 80 

gtg gtc gtt gtt tec age gcg att tct gcc gat aac ccg gaa att gtc 288 
Val Val Val Val Ser Ser Ala lie Ser Ala Asp Asn Pro Glu lie Val 
85 90 95 

gcc get cat gaa gcg cgt att ccg gtg ate cgt cgt gcc gaa atg ctg 336 
Ala Ala His Glu Ala Arg He Pro Val lie Arg Arg Ala Glu Met Leu 
100 105 110 

get gag tta atg cgt ttt cgt cat ggc ate gcc att gcc gga acg cac 384 
Ala Glu Leu Met Arg Phe Arg His Gly lie Ala lie Ala Gly Thr His 
115 120 125 

ggc aaa acg aca acc acc gcg atg gtt tec age ate tac gca gaa gcg 432 
Gly Lys Thr Thr Thr Thr Ala Met Val Ser Ser He Tyr Ala Glu Ala 
130 135 140 

ggg etc gac cca acc ttc gtt aac ggc ggg ctg gta aaa gcg gcg ggg 480 
Gly Leu Asp Pro Thr Phe Val Asn Gly Gly Leu Val Lys Ala Ala Gly 
145 150 155 160 

gtt cat gcg cgt ttg ggg cat ggt egg tac ctg att gcc gaa gca gat 528 
Val His Ala Arg Leu Gly His Gly Arg Tyr Leu He Ala Glu Ala Asp 
165 170 175 

gag agt gat gca teg ttc ctg cat ctg caa ccg atg gtg gcg att gtc 576 
Glu Ser Asp Ala Ser Phe Leu His Leu Gin Pro Met Val Ala He Val 
180 185 190 

acc aat ate gaa gcc gac cac atg gat acc tac cag ggc gac ttt gag 624 
Thr Asn He Glu Ala Asp His Met Asp Thr Tyr Gin Gly Asp Phe Glu 
195 200 205 

aat tta aaa cag act ttt att aat ttt ctg cac aac ctg ccg ttt tac 672 
Asn Leu Lys Gin Thr Phe He Asn Phe Leu His Asn Leu Pro Phe Tyr 
210 215 220 

ggt cgt gcg gtg atg tgt gtt gat gat ccg gtg ate cgc gaa ttg tta 720 
Gly Arg Ala Val Met Cys Val Asp Asp Pro Val He Arg Glu Leu Leu 
225 230 235 240 

ccg cga gtg ggg cgt cag acc acg act tac ggc ttc age gaa gat gcc 768 
Pro Arg Val Gly Arg Gin Thr Thr Thr Tyr Gly Phe Ser Glu Asp Ala • 
245 250 255 

gac gtg cgt gta gaa gat tat cag cag att ggc ccg cag ggg cac ttt 816 
Asp Val Arg Val Glu Asp Tyr Gin Gin He Gly Pro Gin Gly His Phe 
260 265 270 

acg ctg ctg cgc cag gac aaa gag ccg atg cgc gtc acc ctg aat gcg 864 
Thr Leu Leu Arg Gin Asp Lys Glu Pro Met Arg Val Thr Leu Asn Ala 
275 280 ; 285 

cca ggt cgt cat aac gcg ctg aac gcc gca get gcg gtt gcg gtt get 912 
Pro Gly Arg His Asn Ala Leu Asn Ala Ala Ala Ala Val Ala Val Ala 
290 295 300 
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acg gaa gag ggc att gac gac gag get att ttg egg gcg ctt gaa age 960 
Thr Glu Glu Gly lie Asp Asp Glu Ala He Leu Arg Ala Leu Glu Ser 
305 310 315 320 

ttc cag ggg act ggt cgc cgt ttt gat ttc etc ggt gaa ttc ccg ctg 1008 
Phe Gin Gly Thr Gly Arg Arg Phe Asp Phe Leu Gly Glu Phe Pro Leu 
325 330 335 

gag cca gtg aat ggt aaa age ggt acg gca atg ctg gtc gat gac tac 1056 
Glu Pro Val Asn Gly Lys Ser Gly Thr Ala Met Leu Val Asp Asp Tyr 
340 345 350 

ggc cac cac ccg acg gaa gtg gac gec acc att aaa gcg gcg cgc gca 1104 
Gly His His Pro Thr Glu Val Asp Ala Thr He Lys Ala Ala Arg Ala 
355 360 365 

ggc tgg ccg gat aaa aac ctg gta atg ctg ttt cag ccg cac cgt ttt 1152 
Gly Trp Pro Asp Lys Asn Leu Val Met Leu Phe Gin Pro His Arg Phe 
370 375 380 

acc cgt acg cgc gac ctg tat gat gat ttc gec aat gtg ctg acg cag 1200 
Thr Arg Thr Arg Asp Leu Tyr Asp Asp Phe Ala Asn Val Leu Thr Gin 
385 390 395 400 

gtt gat acc ctg ttg atg ctg gaa gtg tat ccg get ggc gaa gcg cca 1248 
Val Asp Thr Leu Leu Met Leu Glu Val Tyr Pro Ala Gly Glu Ala Pro 
405 410 415 

att ccg gga gcg gac age cgt teg ctg tgt cgc aca att cgt gga cgt 1296 
He Pro Gly Ala Asp Ser Arg Ser Leu Cys Arg Thr He Arg Gly Arg 
420 425 430 

ggg aaa att gat ccc att ctg gtg ccg gat ccg gcg egg gta gec gag 1344 
Gly Lys He Asp Pro He Leu Val Pro Asp Pro Ala Arg Val Ala Glu 
435 440 445 

atg ctg gca ccg gta tta acc ggt aac gac ctg att etc gtt cag ggg 1392 
Met Leu Ala Pro Val Leu Thr Gly Asn Asp Leu He Leu Val Gin Gly 
450 455 460 

get ggt aat att gga aaa att gec cgt tct tta get gaa ate aaa ctg 1440 
Ala Gly Asn He Gly Lys He Ala Arg Ser Leu Ala Glu He Lys Leu 
465 470 475 480 

aag ccg caa act ccg gag gaa gaa caa cat gac tga 1476 
Lys Pro Gin Thr Pro Glu Glu Glu Gin His Asp * 
485 490 



<210> 263 
<211> 921 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (921) 

<400> 263 

atg act gat aaa ate gcg gtc ctg ttg ggt ggg acc tec get gag egg 48 
Met Thr Asp Lys He Ala Val Leu Leu Gly Gly Thr Ser Ala Glu Arg 
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15 10 15 

gaa gtt tct ctg aat tct ggc gca gcg gtg tta gcc gga ctg cgt gaa 96 
Glu Val Ser Leu Asn Ser Gly Ala Ala Val Leu Ala Gly Leu Arg Glu 
20 25 30 

ggc ggt att gac gcg tat cct gtc gac ccg aaa gaa gtc gac gtg acg 144 
Gly Gly lie Asp Ala Tyr Pro Val Asp Pro Lys Glu Val Asp Val Thr 
35 40 45 

caa ctg aag teg atg ggc ttt cag aaa gtg ttt ate gcg eta cac ggt 192 
Gin Leu Lys Ser Met Gly Phe Gin Lys Val Phe lie Ala Leu His Gly 
50 55 60 

cgc ggc ggt gaa gat ggt acg ctg cag ggg atg etc gag ctg atg ggc 240 
Arg Gly Gly Glu Asp Gly Thr Leu Gin Gly Met Leu Glu Leu Met Gly 
65 70 75 80 

ttg cct tat ace gga age gga gtg atg gca tct gcg ctt tea atg gat 288 
Leu Pro Tyr Thr Gly Ser Gly Val Met Ala Ser Ala Leu Ser Met Asp 
85 90 95 

aaa eta cgc age aaa ctt eta tgg caa ggt gcc ggt tta ccg gtc gcg 336 
Lys Leu Arg Ser Lys Leu Leu Trp Gin Gly Ala Gly Leu Pro Val Ala 
100 105 110 

ccg tgg gta gcg tta acc cgc gca gag ttt gaa aaa ggc ctg age gat 384 
Pro Trp Val Ala Leu Thr Arg Ala Glu Phe Glu Lys Gly Leu Ser Asp 
115 120 125 

aag cag tta gca gaa att tct get ctg ggt ttg ccg gtt ate gtt aag 432 
Lys Gin Leu Ala Glu lie Ser Ala Leu Gly Leu Pro Val lie Val Lys 
130 135 140 

ccg age cgc gaa ggt tec agt gtg gga atg tea aaa gta gta gca gaa 480 
Pro Ser Arg Glu Gly Ser Ser Val Gly Met Ser Lys Val Val Ala Glu 
145 ~ 150 155 160 

aat get eta caa gat gca tta aga ttg gca ttt cag cac gat gaa gaa 528 
Asn Ala Leu Gin Asp Ala Leu Arg Leu Ala Phe Gin His Asp Glu Glu 
165 170 175 

gta ttg att gaa aaa tgg eta agt ggg ccg gag ttc acg gtt gcg ata 576 
Val Leu lie Glu Lys Trp Leu Ser Gly Pro Glu Phe Thr Val Ala lie 
180 185 190 

etc ggt gaa gaa att tta ccg tea ata cgt att caa ccg tec gga acc 624 
Leu Gly Glu Glu lie Leu Pro Ser lie Arg lie Gin Pro Ser Gly Thr 
195 200 205 

ttc tat gat tat gag gcg aag tat etc tct gat gag aca cag tat ttc 672 
Phe Tyr Asp Tyr Glu Ala Lys Tyr Leu Ser Asp Glu Thr Gin Tyr Phe 
210 215 220 

tgc ccc gca ggt ctg gaa gcg tea caa gag gcc aat ttg cag gca tta 720 
Cys Pro Ala Gly Leu Glu Ala Ser Gin Glu Ala Asn Leu Gin Ala Leu 
225 230 235 240 

gtg ctg aaa gca tgg acg acg tta ggt tgc aaa gga tgg gga cgt att 768 
Val Leu Lys Ala Trp Thr Thr Leu Gly Cys Lys Gly Trp Gly Arg He 
245 250 255 
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gac gtt atg ctg gac age gat gga cag ttt tat ctg ctg gaa gec aat 816 
Asp Val Met Leu Asp Ser Asp Gly Gin Phe Tyr Leu Leu Glu Ala Asn 
260 265 270 

acc tea ccg ggt atg acc age cac age ctg gtg ccg atg gcg gca cgt 864 
Thr Ser Pro Gly Met Thr Ser His Ser Leu Val Pro Met Ala Ala Arg 
275 280 285 

cag gca ggt atg age ttc teg cag ttg.gta gta cga att ctg gaa ctg 912 
Gin Ala t Gly Met Ser Phe Ser Gin Leu Val Val Arg lie Leu Glu Leu 
290 295 300 

gcg gac taa 921 

Ala Asp * 

305 



<210> 264 
<211> 606 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (606) 

<400> 264 

atg gcg ctg cat gac gaa aac gtc gtc tgg cat age cat ccg gtc act 
Met Ala Leu His Asp Glu Asn Val Val Trp His Ser His Pro Val Thr 
15 10 15 



48 



gtg caa caa cgc gag eta cac cac ggt cat cgt ggt gta gtg ctg tgg 96 
Val Gin Gin Arg Glu Leu His His Gly His Arg Gly Val Val Leu Trp 
20 25 30 

ttt acc ggc etc tec ggg tec ggt aaa tea acg gtc gee ggg gcg ctg 144 
Phe Thr Gly Leu Ser Gly Ser Gly Lys Ser Thr Val Ala Gly Ala Leu 
35 40 45 

gag gag gcg tta cat aaa etc ggc gtc agt acg tat ctg ctg gat ggc 192 
Glu Glu Ala Leu His Lys Leu Gly Val Ser Thr Tyr Leu Leu Asp Gly 
50 55 60 

gac aat gtt cgc cac gga tta tgc age gat etc ggt ttt age gat gee 240 
Asp Asn Val Arg His Gly Leu Cys Ser Asp Leu Gly Phe Ser Asp Ala 
65 70 75 80 

gat cgt aaa gag aat ate cgt cgc gtc ggt gaa gtg gcg aat ttg atg 288 
Asp Arg Lys Glu Asn lie Arg Arg Val Gly Glu Val Ala Asn Leu Met 
85 90 95 

gtt gaa gec gga ctg gtg gtg ctg acc gca ttt ate teg cca cac cgc 336 
Val Glu Ala Gly Leu Val Val Leu Thr Ala Phe He Ser Pro His Arg 
100 105 110 

gee gaa cgc cag atg gtt cgc gaa cgc gta gga gaa ggg cgc ttt ate 384 
Ala Glu Arg Gin Met Val Arg Glu Arg Val Gly Glu Gly Arg Phe He 
115 120 125 

gaa gtg ttt gtc gat acg ccg ctg gcg att tgc gaa gee cgc gat ccc 432 
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Glu Val Phe Val Asp Thr Pro Leu Ala lie Cys Glu Ala Arg Asp Pro 
130 135 140 

aaa ggc tta tat aag aaa gcg cgt gcc ggt gaa ctg cgc aac ttt acg 480 
Lys Gly Leu Tyr Lys Lys Ala Arg Ala Gly Glu Leu Arg Asn Phe Thr 
145 150 155 160 

gga ata gat tec gtt tac gaa gcg cct gaa teg gca gaa att cat etc 528 
Gly lie Asp Ser Val Tyr Glu Ala Pro Glu Ser Ala Glu lie His Leu 
165 170 175 

aat ggt gaa caa tta gta aca aat ttg gta cag caa tta tta gat ctg 576 
Asn Gly Glu Gin Leu Val Thr Asn Leu Val Gin Gin Leu Leu Asp Leu 
180 185 190 

ttg aga cag aac gat att ate aga tec tga 606 
Leu Arg Gin Asn Asp lie lie Arg Ser * 
195 200 



<210> 265 
<211> 1428 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1428) 

<400> 265 

atg aac acc gca ctt gca caa caa ate gcc aat gaa ggc ggc gtc gaa 48 

Met Asn Thr Ala Leu Ala Gin Gin He Ala Asn Glu Gly Gly Val Glu 
1 5 10 15 

gcc tgg atg att gcg caa caa cat aaa age ctg ctg cgt ttt ctg acc 96 
Ala Trp Met He Ala Gin Gin His Lys Ser Leu Leu Arg Phe Leu Thr 
20 25 ~ 30 

tgt ggt age gtc gat gac ggc aaa agt act ctg att ggt cgt ctg ctg 144 
Cys Gly Ser Val Asp Asp Gly Lys Ser Thr Leu He Gly Arg Leu Leu 
35 40 45 

cac gat acc cgc caa ate tac gaa gat cag etc tea teg ctg cat aac 192 
His Asp Thr Arg Gin lie Tyr Glu Asp Gin Leu Ser Ser Leu His Asn 
50 55 60 

gac agt aag cgt cac ggc acc cag ggc gaa aag ctg gat ctg get ctg 240 
Asp Ser Lys Arg His Gly Thr Gin Gly Glu Lys Leu Asp Leu Ala Leu 
65 70 75 80 

ctg gtg gac ggc ctg caa get gag cgc gaa cag ggc ate acc att gac 288 
Leu Val Asp Gly Leu Gin Ala Glu Arg Glu Gin Gly He Thr He Asp 
85 90 95 

gtg gcc tac cgc tat ttc tct acc gag aag cgt aaa ttt att ate gcc 336 
Val Ala Tyr Arg Tyr Phe Ser Thr Glu Lys Arg Lys Phe He He Ala 
100 105 * 110 

gac acc cca ggg cac gag cag tac acc cgc aat atg gcg act ggc gca 384 
Asp Thr Pro Gly His Glu Gin Tyr Thr Arg Asn Met Ala Thr Gly Ala 
115 120 125 
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teg aca tgt gaa ctg gcg ate tta ctg ate gat gee cgt aaa ggc gtg 432 
Ser Thr Cys Glu Leu Ala lie Leu Leu lie Asp Ala Arg Lys Gly Val 
130 135 140 

etc gat caa acc cgt cgt cac agt ttt ate tec aca ctg ttg ggg ate 480 
Leu Asp Gin Thr Arg Arg His Ser Phe lie Ser Thr Leu Leu Gly lie 
145 150 155 160 

aaa cat ctg gtc gtg gcg ate aac aaa atg gat ctg gtg gat tac agt 528 
Lys His Leu Val Val Ala He Asn Lys Met Asp Leu Val Asp Tyr Ser 
165 170 175 

gaa gag acg ttc acc cgt att cgt gaa gat tat ttg acc ttt gec ggg 576 
Glu Glu Thr Phe Thr Arg He Arg Glu Asp Tyr Leu Thr. Phe Ala Gly 
180 185 190 

cag ctg ccg ggt aat ctg gat ate cgc ttt gtg ccg etc tct gca ctg 624 
Gin Leu Pro Gly Asn Leu Asp He Arg Phe Val Pro Leu Ser Ala Leu 
195 200 205 

gaa ggc gac aac gtg gca teg caa agt gaa agt atg ccg tgg tac age 672 
Glu Gly Asp Asn Val Ala Ser Gin Ser Glu Ser Met Pro Trp Tyr Ser 
210 215 220 

ggt ccg aca ctg etc gaa gtg ctg gaa acc gtg gag ate cag cga gtg 720 
Gly Pro Thr Leu Leu Glu Val Leu Glu Thr Val Glu He Gin Arg Val 
225 230 235 240 

gtg gat get cag cca atg cgc ttc ccg gtg cag tac gtt aat cgc ccg 768 
Val Asp Ala Gin Pro Met Arg Phe Pro Val Gin Tyr Val Asn Arg Pro 
245 250 255 

aat etc gat ttt cgt ggt tac gee gga acg ctg gca tec ggt cgc gtg 816 
Asn Leu Asp Phe Arg Gly Tyr Ala Gly Thr Leu Ala Ser Gly Arg Val 
260 265 270 

gaa gtc ggg caa cgt gtc aaa gtg ctg ccc tct ggt gtg gaa tea aac 864 
Glu Val Gly Gin Arg Val Lys Val Leu Pro Ser Gly Val Glu Ser Asn 
275 280 285 

gtc gcg egg ate gtg act ttt gat ggt gat cgc gaa gaa gec ttt gee 912 
Val Ala Arg He Val Thr Phe Asp Gly Asp Arg Glu Glu Ala Phe Ala 
290 295 300 

gga gaa gcg ate acc ctg gtg ctg acg gat gag ate gac ate age cgt 960 
Gly Glu Ala He Thr Leu Val Leu Thr Asp Glu He Asp He Ser Arg 
305 310 315 320 

ggc gat ctg ctg ctg gcg gca gac gaa gcg tta ccg gcg gtg cag age 1008 
Gly Asp Leu Leu Leu Ala Ala Asp Glu Ala Leu Pro Ala Val Gin Ser 
325 330 335 

gcg teg gtg gat gtg gta tgg atg gcg gaa cag ccg ctt tct cca ggg 1056 
Ala Ser Val Asp Val Val Trp Met Ala Glu Gin Pro Leu Ser Pro Gly 
340 345 350 

cag agt tac gac ate aaa att gec ggt aag aag acg cgc gcg cgt gtt 1104 
Gin Ser Tyr Asp He Lys He Ala Gly Lys Lys Thr Arg Ala Arg Val 
355 360 365 
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gat ggc att cgc tat cag gtt gat att aat aac ctt acc cag cgt gaa 1152 
Asp Gly lie Arg Tyr Gin Val Asp lie Asn Asn Leu Thr Gin Arg Glu 
370 375 380 

gtt gaa aac ctg cca ctg aat ggg ate ggc etc gtg gat etc act ttt 1200 
Val Glu Asn Leu Pro Leu Asn Gly lie Gly Leu Val Asp Leu Thr Phe 
385 390 395 400 

gac gag ccg ctg gtg tta gat cgt tat caa caa aat ccg gtg acg ggt 1248 
Asp Glu Pro Leu Val Leu Asp Arg Tyr Gin Gin Asn Pro Val Thr Gly 
405 410 415 

ggg ctg att ttt ate gat cgc ctg age aat gtg acc gtg ggt gee ggt 1296 
Gly Leu He Phe He Asp Arg Leu Ser Asn Val Thr Val Gly Ala Gly 
420 425 430 



atg gtg cac gag cca gtt age cag gca act get gcg cca tct gaa ttc 1344 
Met Val His Glu Pro Val Ser Gin Ala Thr Ala Ala Pro Ser Glu Phe 
435 440 445 

agt gca ttc gaa ctg gaa ttg aat get ctg gtt cgt cgc cac ttt ccg 1392 
Ser Ala Phe Glu Leu Glu Leu Asn Ala Leu Val Arg Arg His Phe Pro 
450 455 460 

cac tgg ggc gcg cgc gat ttg ctg ggg gat aaa taa 1428 
His Trp Gly Ala Arg Asp Leu Leu Gly Asp Lys * 
465 470 475 



<210> 266 
<211> 384 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (384) 

<400> 266 

atg cgc cat cgt aag agt ggt cgt caa ctg aac cgc aac age age cat 48 
Met Arg His Arg Lys Ser Gly Arg Gin Leu Asn Arg Asn Ser Ser His 
15 10 15 



cgc cag get atg ttc cgc aat atg gca ggt tea ctg gtt cgt cat gaa 96 
Arg Gin Ala Met Phe Arg Asn Met Ala Gly Ser Leu Val Arg His Glu 
20 25 30 

ate ate aag acg act ctg cct aaa gcg aaa gag ctg cgc cgc gta gtt 144 
He He Lys Thr Thr Leu Pro Lys Ala Lys Glu Leu Arg Arg Val Val 
35 40 45 

gag ccg ctg att act ctt gee aag act gat age gtt get aat cgt cgt 192 
Glu Pro Leu He Thr Leu Ala Lys Thr Asp Ser Val Ala Asn Arg Arg 
50 55 60 

ctg gca ttc gec cgt act cgt gat aac gag ate gtg gca aaa ctg ttt 240 
Leu Ala Phe Ala Arg Thr Arg Asp Asn Glu He Val Ala Lys Leu Phe 
65 70 75 80 

aac gaa ctg ggc ccg cgt ttc gcg age cgt gec ggt ggt tac act cgt 288 
Asn Glu Leu Gly Pro Arg Phe Ala Ser Arg Ala Gly Gly Tyr Thr Arg 
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85 90 95 

att ctg aag tgt ggc ttc cgt gca ggc gac aac gcg ccg atg get tac 336 
lie Leu Lys Cys Gly Phe Arg Ala Gly Asp Asn Ala Pro Met Ala Tyr 
100 105 110 

ate gag ctg gtt gat cgt tea gag aaa gca gaa get get gca gag taa 384 
lie Glu Leu Val Asp Arg Ser Glu Lys Ala Glu Ala Ala Ala Glu * 
115 120 125 



<210> 267 
<211> 990 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (990) 

<400> 267 

atg cag ggt tct gtg aca gag ttt eta aaa ccg cgc ctg gtt gat ate 48 
Met Gin Gly Ser Val Thr Glu Phe Leu Lys Pro Arg Leu Val Asp lie 
15 10 15 

gag caa gtg agt teg acg cac gee aag gtg acc ctt gag cct tta gag 96 
Glu Gin Val Ser Ser Thr His Ala Lys Val Thr Leu Glu Pro Leu Glu 
20 25 30 

cgt ggc ttt ggc cat act ctg ggt aac gca ctg cgc cgt att ctg etc 144 
Arg Gly Phe Gly His Thr lieu Gly Asn Ala Leu Arg Arg lie Leu Leu 
35 40 45 

tea teg atg ccg ggt tgc gcg gtg acc gag gtt gag att gat ggt gta 192 
Ser Ser Met Pro Gly Cys Ala Val Thr Glu Val Glu lie Asp Gly Val 
50 55 60 

eta cat gag tac age acc aaa gaa ggc gtt cag gaa gat ate ctg gaa 240 
Leu His Glu Tyr Ser Thr Lys Glu Gly Val Gin Glu Asp lie Leu Glu 
65 70 75 80 

ate ctg etc aac ctg aaa ggg ctg gcg gtg aga gtt cag ggc aaa gat 288 
lie Leu Leu Asn Leu Lys Gly Leu Ala Val Arg Val Gin Gly Lys Asp 
85 90 95 

gaa gtt att ctt acc ttg aat aaa tct ggc att ggc cct gtg act gca 336 
Glu Val lie Leu Thr Leu Asn Lys Ser Gly He Gly Pro Val Thr Ala 
100 105 110 

gee gat ate acc cac gac ggt gat gtc gaa ate gtc aag ccg cag cac 384 
Ala Asp He Thr His Asp Gly Asp Val Glu He Val Lys Pro Gin His 
115 " 120 125 

gtg ate tgc cac ctg acc gat gag aac gcg tct att age atg -cgt ate 432 
Val lie Cys His Leu Thr Asp Glu Asn Ala Ser He Ser Met Arg He 
130 135 140 

aaa gtt cag cgc ggt cgt ggt tat gtg ccg get tct acc cga att cat 480 
Lys Val Gin Arg Gly Arg Gly Tyr Val Pro Ala Ser Thr Arg He His 
145 150 155 160 
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teg gaa gaa gat gag cgc cca ate ggc cgt ctg ctg gtc gac gca tgc 528 
Ser Glu Glu Asp Glu Arg Pro lie Gly Arg Leu Leu val Asp Ala Cys 
165 170 175 

tac age cct gtg gag cgt att gee tac aat gtt gaa gca gcg cgt gta 576 
Tyr Ser Pro Val Glu Arg lie Ala Tyr Asn Val Glu Ala Ala Arg Val 
180 185 190 

gaa cag cgt acc gac ctg gac aag ctg gtc ate gaa atg gaa ace aac 624 
Glu Gin Arg Thr Asp Leu Asp Lys Leu Val lie Glu Met Glu Thr Asn 
195 200 205 

ggc aca ate gat cct gaa gag gcg att cgt cgt gcg gca acc att ctg 672 
Gly Thr He Asp Pro Glu Glu Ala He Arg Arg Ala Ala Thr He Leu 
210 215 220 

get gaa caa ctg gaa get ttc gtt gac tta cgt gat gta cgt cag cct 720 
Ala Glu Gin Leu Glu Ala Phe Val Asp Leu Arg Asp Val Arg Gin Pro 
225 230 235 240 

gaa gtg aaa gaa gag aaa cca gag ttc gat ccg ate ctg ctg cgc cct 768 
Glu Val Lys Glu Glu Lys Pro Glu Phe Asp Pro He Leu Leu Arg Pro 
245 250 255 

gtt gac gat ctg gaa ttg act gtc cgc tct get aac tgc ctt aaa gca 816 
Val Asp Asp Leu Glu Leu Thr Val Arg Ser Ala Asn Cys Leu Lys Ala 
260 265 270 

gaa get ate cac tat ate ggt gat ctg gta cag cgt acc gag gtt gag 864 
Glu Ala He His Tyr He Gly Asp Leu Val Gin Arg Thr Glu Val Glu 
275 280 285 

etc ctt aaa acg cct aac ctt ggt aaa aaa tct ctt act gag att aaa 912 
Leu Leu Lys Thr Pro Asn Leu Gly Lys Lys Ser Leu Thr Glu He Lys 
290 295 300 

gac gtg ctg get tec cgt gga ctg tct ctg ggc atg cgc ctg gaa aac 960 
Asp Val Leu Ala Ser Arg Gly Leu Ser Leu Gly Met Arg Leu Glu Asn 
305 310 315 320 

tgg cca ccg gca age ate get gac gag taa 990 
Trp Pro Pro Ala Ser He Ala Asp Glu * 
325 



<210> 268 
<211> 621 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (621) 

<400> 268 

atg gca aga tat ttg ggt cct aag 
Met Ala Arg Tyr Leu Gly Pro Lys 
1 5 

acc gac tta ttc ctt aag tct ggc 



etc aag ctg age cgt cgt gag ggc 48 
Leu Lys Leu Ser Arg Arg Glu Gly 
10 15 

gtt cgc gcg ate gat acc aag tgt 96 
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Thr Asp Leu Phe Leu Lya Ser Gly Val Arg Ala lie Asp Thr Lys Cys 
20 25 30 

aaa att gaa caa get cct ggc cag cac ggt gcg cgt aaa ccg cgt ctg 144 
Lys He Glu Gin Ala Pro Gly Gin His Gly Ala Arg Lys Pro Arg lieu 
35 40 45 

tct gac tat ggt gtg cag ttg cgt gaa aag caa aaa gtt cgc cgt ate 192 
Ser Asp Tyr Gly Val Gin Leu Arg Glu Lys Gin Lys Val Arg Arg He 
50 55 60 

tat ggt gtg ctg gag cgt cag ttc cgt aac tac tac aaa gaa gca gca 240 
Tyr Gly Val Leu Glu Arg Gin Phe Arg Asn Tyr Tyr Lys Glu Ala Ala 
65 70 75 80 

cgt ctg aaa ggc aac acc ggt gaa aac ctg ttg get ctg ctg gaa ggt 288 
Arg Leu Lys Gly Asn Thr Gly Glu Asn Leu Leu Ala Leu Leu Glu Gly 
85 90 95 

cgt ctg gac aac gtt gta tac cgt atg ggc ttc ggt gec act cgt gca 336 
Arg Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Ala Thr Arg Ala 
100 105 110 

gaa gca cgt cag ctg gtt age cat aaa gca att atg gta aac ggt cgt 384 
Glu Ala Arg Gin Leu Val Ser His Lys Ala He Met Val Asn Gly Arg 
115 120 125 

gtt gtt aac ate get tct tat cag gtt agt ccg aat gac gtt gta age 432 
Val Val Asn He Ala Ser Tyr Gin Val Ser Pro Asn Asp Val Val Ser 
130 135 140 

att cgt gag aaa gcg aag aag cag tct cgc gtg aaa gee get ctg gag 480 
He Arg Glu Lys Ala Lys Lys Gin Ser Arg Val Lys Ala Ala Leu Glu 
145 150 155 160 

ctg get gag cag cgt gaa aag cca acc tgg ctg gaa gtt gat get ggc 528 
Leu Ala Glu Gin Arg Glu Lys Pro Thr Trp Leu Glu Val Asp Ala Gly 
165 170 175 

aag atg gaa ggt acg ttt aag cgt aag ccg gag cgt tct gat ctg tct 576 
Lys Met Glu Gly Thr Phe Lys Arg Lys Pro Glu Arg Ser Asp Leu Ser 
180 185 190 

gcg gac att aac gaa cac ctg ate gtc gag ctt tac tec aag taa 621 
Ala Asp He Asn Glu His Leu He Val Glu Leu Tyr Ser Lys * 
195 200 205 



<210> 269 
<211> 390 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ...(390) 

<400> 269 

atg gca aag gca cca att cgt gca cgt aaa cgt gta aga aaa caa gtc 48 
Met Ala Lys Ala Pro He Arg Ala Arg Lys Arg Val Arg Lys Gin Val 
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15 10 15 

tct gac ggc gtg get cat ate cat get tct ttc aac aac ace ate gtg 96 
Ser Asp Gly Val Ala His lie His Ala Ser Phe Asn Asn Thr He Val 
20 25 30 

act ate act gat cgt cag ggt aac gcg ttg ggt tgg gca aca gee ggt 144 
Thr He Thr Asp Arg Gin Gly Asn Ala Leu Gly Trp Ala Thr Ala Gly 
35 40 45 

ggt tec ggt ttc cgt ggt tct cgc aaa tec act ccg ttt gca get cag 192 
Gly Ser Gly Phe Arg Gly Ser Arg Lys Ser Thr Pro Phe Ala Ala Gin 
50 55 60 

gtt gca gca gag cgt tgc get gac gee gtg aaa gaa tac ggc ate aag 240 
Val Ala Ala Glu Arg Cys Ala Asp Ala Val Lys Glu Tyr Gly lie Lys 
65 70 75 80 

aat ctg gaa gtt atg gtt aaa ggt ccg ggt cca ggc cgc gaa tct act 288 
Asn Leu Glu Val Met Val Lys Gly Pro Gly Pro Gly Arg Glu Ser Thr 
85 90 95 

att cgt get ctg aac gee gca ggt ttc cgc ate act aac att act gat 336 
He Arg Ala Leu Asn Ala Ala Gly Phe Arg He Thr Asn He Thr Asp 
100 105 110 

gtg act ccg ate cct cat aac ggt tgt cgt ccg ccg aaa aaa cgt cgc 384 
Val Thr Pro He Pro His Asn Gly Cys Arg Pro Pro Lys Lys Arg Arg 
115 120 125 

gta taa 390 
Val * 



<210> 270 
<211> 357 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . - . (357) 

<400> 270 

9tg gee cgt ata gca ggc att aac att cct gat cat aag cat gee gta 48 
Met Ala Arg He Ala Gly He Asn He Pro Asp His Lys His Ala Val 
15 10 15 

ate gca tta act teg att tat ggc gtc ggc aag ace cgt tct aaa gec 96 
He Ala Leu Thr Ser He Tyr Gly Val Gly Lys Thr Arg Ser Lys Ala 
20 25 30 

ate ctg get gca gcg ggt ate get gaa gat gtt aag ate agt gag ctg 144 
He Leu Ala Ala Ala Gly He Ala Glu Asp Val Lys He Ser Glu Leu 
35 40 45 

tct gaa gga caa ate gac acg ctg cgt gac gaa gtt gec aaa ttt gtc 192 
Ser Glu Gly Gin He Asp Thr Leu Arg Asp Glu Val Ala Lys Phe Val 
50 55 60 
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gtt gaa ggt gat ctg cgc cgt gaa ate age atg age ate aag cgc ctg 240 
Val Glu Gly Asp Leu Arg Arg Glu lie Ser Met Ser lie Lys Arg Leu 
65 70 75 80 

atg gat ctt ggt tgc tat cgc ggt ttg cgt cat cgt cgt ggt etc ccg 288 
Met Asp Leu Gly Cys Tyr Arg Gly Leu Arg His Arg Arg Gly Leu Pro 
85 90 95 

gtt cgc ggt cag cgt acc aag ace aac gca cgt acc cgt aag ggt ccg 336 
Val Arg Gly Gin Arg Thr Lys Thr Asn Ala Arg Thr Arg Lys Gly Pro 
100 105 110 

cgc aaa ccg ate aag aaa taa 357 
Arg Lys Pro He Lys Lys * 
115 



<210> 271 
<211> 1383 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1383) 

<400> 271 

atg acg caa tta acc atg aaa gac aaa att ggc tac ggg ctg gga gac 48 
Met Thr Gin Leu Thr Met Lys Asp Lys He Gly Tyr Gly Leu Gly Asp 
15 10 15 

acc gee tgc ggc ttc gtc tgg cag gee acg atg ttc ctg ctg gee tat 96 
Thr Ala Cys Gly Phe Val Trp Gin Ala Thr Met Phe Leu Leu Ala Tyr 
20 25 30 

ttc tac acc gac gtc ttc ggc ctg teg gcg ggg att atg ggc acg ctg 144 
Phe Tyr Thr Asp Val Phe Gly Leu Ser Ala Gly He Met Gly Thr Leu 
35 40 45 

ttt ttg gtc tec cgc gtg etc gac gee gtc acc gac ccg ctg atg ggg 192 
Phe Leu Val Ser Arg Val Leu Asp Ala Val Thr Asp Pro Leu Met Gly 
50 55 60 

ctg ctg gta gac cgc acc cgc acg egg cac ggc cag ttc cgc ccg ttc 240 
Leu Leu Val Asp Arg Thr Arg Thr Arg His Gly Gin Phe Arg Pro Phe 
65 70 75 80 

ctg ctg tgg ggg gee ate ccg ttc ggc ate gtc tgc gtg ctg acc ttc 288 
Leu Leu Trp Gly Ala He Pro Phe Gly He Val Cys Val Leu Thr Phe 
85 90 95 

tac acg ccg gac ttc tec gca cag ggc aag ate ate tac gee tgc gtg 336 
Tyr Thr Pro Asp Phe Ser Ala Gin Gly Lys He He Tyr Ala Cys Val 
100 105 110 

acc tac att etc ctg acc ctg gtc tac acc ttc gtt aac gtg ccg tac 384 
Thr Tyr He Leu Leu Thr Leu Val Tyr Thr Phe Val Asn Val Pro Tyr 
115 120 125 

tgc gee atg ccg ggc gtc ate acc gee gac ccg aaa gag cgt cac gee 432 
Cys Ala Met Pro Gly Val He Thr Ala Asp Pro Lys Glu Arg His Ala 
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130 135 140 

ctg cag tec tgg cgc ttc ttc ctg gcg gcg gcg ggc teg etc get ate 480 
Leu Gin Ser Trp Arg Phe Phe Leu Ala Ala Ala Gly Ser Leu Ala lie 
145 150 155 160 

age ggc ate gcg ctg ccg ctg gtg age ate ate ggc aaa ggg gac gag 528 
Ser Gly lie Ala Leu Pro Leu Val Ser lie lie Gly Lys Gly Asp Glu 
165 170 175 

cag gtg ggc tac ttc ggc gee atg tgc gtg ctg ggg ctg age ggc gtg 576 
Gin Val Gly Tyr Phe Gly Ala Met Cys Val Leu Gly Leu Ser Gly Val 
180 185 190 

gtg ctg etc tac gtc tgc ttc ttc acg acc aaa gag cgc tac ace ttt 624 
Val Leu Leu Tyr Val Cys Phe Phe Thr Thr Lys Glu Arg Tyr Thr Phe 
195 200 205 

gag gtg cag ccg ggc teg teg gtg gcg aaa gac ctt aag ctg ctg ctg 672 
Glu Val Gin Pro Gly Ser Ser Val Ala Lys Asp Leu Lys Leu Leu Leu 
210 215 220 

ggc aac age cag tgg cgc ate atg tgc gcg ttc aag atg atg gcg acc 720 
Gly Asn Ser Gin Trp Arg lie Met Cys Ala Phe Lys Met Met Ala Thr 
225 230 235 240 

tgc tec aac gtg gtg cgc ggc ggg gcg acg etc tac ttc gtg aaa tac 768 
Cys Ser Asn Val Val Arg Gly Gly Ala Thr Leu Tyr Phe Val Lys Tyr 
245 250 255 

gtg atg gat cac ccg gag ttg gcg acc cag ttt tta ctt tac ggc age 816 
Val Met Asp His Pro Glu Leu Ala Thr Gin Phe Leu Leu Tyr Gly Ser 
260 265 270 

etc gee acc atg ttc ggc teg ctt tgc tec tea cgc ctg ctg ggc cgc 864 
Leu Ala Thr Met Phe Gly Ser Leu Cys Ser Ser Arg Leu Leu Gly Arg 
275 280 285 

ttc gac cgc gtc acc gee ttc aag tgg ate ate gtc gee tac teg ctg 912 
Phe Asp Arg Val Thr Ala Phe Lys Trp lie lie Val Ala Tyr Ser Leu 
290 295 300 

ate age ctg ctg att ttc gtc acc ccg gcg gag cac ate gcg etc att 960 
lie Ser Leu Leu lie Phe Val Thr Pro Ala Glu His lie Ala Leu lie 
305 310 315 320 

ttt gee etc aac ate ctg ttc ctg ttc gtc ttt aat acc acc acg ccg 1008 
Phe Ala Leu Asn lie Leu Phe Leu Phe Val Phe Asn Thr Thr Thr Pro 
325 330 335 

ctg cag tgg ctg atg get tct gac gtg gtg gac tac gag gag age cgc 1056 
Leu Gin Trp Leu Met Ala Ser Asp Val Val Asp Tyr Glu Glu Ser Arg 
340 345 350 

a gc ggt cgc cgc etc gac ggg ctg gtg ttc tec acc tac ctg ttc age 1104 
Ser Gly Arg Arg Leu Asp Gly Leu Val Phe Ser Thr Tyr Leu Phe Ser 
355 360 365 

ctg aag att ggc ctg gcg att ggc ggg gcg gtg gtg ggc tgg ate ctg 1152 
Leu Lys lie Gly Leu Ala lie Gly Gly Ala Val Val Gly Trp He Leu 
370 375 380 
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gcg tac gtc aac tat tec gec age age age gtg cag ccg gtt gag gtg 1200 
Ala Tyr Val Asn Tyr Ser Ala Ser Ser Ser Val Gin Pro Val Glu Val 
385 390 395 400 

etc acc ace ate aaa att ctg ttc tgc gtg gtg ccg gtg gtg etc tac 1248 
Leu Thr Thr He Lys He Leu Phe Cys Val Val Pro Val Val Leu Tyr 
405 410 415 

gcg ggc atg ttc ate atg ctg teg etc tac aag etc acc gat gec cgc 1296 
Ala Gly Met Phe He Met Leu Ser Leu Tyr Lys Leu Thr Asp Ala Arg 
420 425 430 

gtg gag gec ate age egg cag ctg att aag cac cgc gcg gcg cag ggc 1344 
Val Glu Ala He Ser Arg Gin Leu He Lys His Arg Ala Ala Gin Gly 
435 440 445 

gag gee gtt ccc gac gee gcg aca gee gca tec cat taa 1383 
Glu Ala Val Pro Asp Ala Ala Thr Ala Ala Ser His * 
450 455 460 



<210> 272 
<211> 1611 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1611) 



<400> 272 

atg gaa ate act aac ccg ata etc acc ggc ttc aac ccg gac ccg tec 48 
Met Glu He Thr Asn Pro He Leu Thr Gly Phe Asn Pro Asp Pro Ser 
15 10 15 

ctg tgc cgc cag ggc gag gac tac tac ate gec acc teg acc ttc gag 96 
Leu Cys Arg Gin Gly Glu Asp Tyr Tyr He Ala Thr Ser Thr Phe Glu 
20 25 30 

tgg ttc ccg ggc gtg cgc ate tac cac tec cgt gac ctg aaa aac tgg 144 
Trp Phe Pro Gly Val Arg He Tyr His Ser Arg Asp Leu Lys Asn Trp 
35 40 45 

teg ctg gtc age acc ccg ttg gac cgc gtg teg atg ctg gac atg aag 192 
Ser Leu Val Ser Thr Pro Leu Asp Arg Val Ser Met Leu Asp Met Lys 
50 55 60 

ggc aac ccg gac tec ggc ggc ate tgg gcg ccg tgc ctg age tac gee 240 
Gly Asn Pro Asp Ser Gly Gly He Trp Ala Pro Cys Leu Ser Tyr Ala 
65 70 75 B0 

gac ggt aaa ttc tgg ctg etc tac acc gac gtg aag att gtc gac teg 288 
Asp Gly Lys Phe Trp Leu Leu Tyr Thr Asp Val Lys He Val Asp Ser 
85 90 95 

ccg tgg aaa aac ggc cgc aac ttc etc gtc acc gcg ccc tec ate gag 336 
Pro Trp Lys Asn Gly Arg Asn Phe Leu Val Thr Ala Pro Ser He Glu 
100 105 110 

ggg cca tgg age gag cca ate ccg atg ggc aac ggc ggg ttt gac ccg 384 
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Gly Pro Trp Ser Glu Pro lie Pro Met Gly Asn Gly Gly Phe Asp Pro 
115 120 125 

tec ctg ttc cac gac gac gat ggc cgc aaa tac tat ate tac cgc ccg 432 
Ser Leu Phe His Asp Asp Asp Gly Arg Lys Tyr Tyr lie Tyr Arg Pro 
130 135 140 

tgg ggg ccg cgc cac cac age aac ccg cac aac acc ate gtg tta cag 480 
Trp Gly Pro Arg His His Ser Asn Pro His Asn Thr lie Val Leu Gin : 
145 150 155 160 

gcg ttt gac ccg cag acc ggc acg etc teg ccc gag cgc aaa acg ctg 528 
Ala Phe Asp Pro Gin Thr Gly Thr Leu Ser Pro Glu Arg Lys Thr Leu 
165 170 175 

ttt acc ggc acg ccg etc tgc tac acc gaa ggc gcg cac ctg tat cgc 576 
Phe Thr Gly Thr Pro Leu Cys Tyr Thr Glu Gly Ala His Leu Tyr Arg 
180 185 190 

cac gcg gga tgg tac tac ctg atg gec gec gag ggc ggc acc age tac 624 
His Ala Gly Trp Tyr Tyr Leu Met Ala Ala Glu Gly Gly Thr Ser Tyr 
195 200 205 

gag cac gec gtc gtg gtg ctg cgt tec aaa aat ate gac ggg ccg tac 672 
Glu His Ala Val Val Val Leu Arg Ser Lys Asn lie Asp Gly Pro Tyr 
210 215 220 

gag ctg cac ccg gac gta acg atg atg acc age tgg cac ctg ccg gag 720 
Glu Leu His Pro Asp Val Thr Met Met Thr Ser Trp His Leu Pro Glu 
225 230 235 240 

aac ccg ctg cag aag age ggc cac ggc teg ctg ctg cag acg cat acg 768 
Asn Pro Leu Gin Lys Ser Gly His Gly Ser Leu Leu Gin Thr His Thr 
245 250 255 

ggt gaa tgg tac atg gee tac etc acc age cgc ccg ctg cgc ctg ccc 816 
Gly Glu Trp Tyr Met Ala Tyr Leu Thr Ser Arg Pro Leu Arg Leu Pro 
260 265 270 

ggc gtg ccg ctg ctg gee tec ggc gga cgc ggc tac tgc ccg ctg ggg 864 
Gly Val Pro Leu Leu Ala Ser Gly Gly Arg Gly Tyr Cys Pro Leu Gly 
275 280 285 

cgc gag acc ggc ate gee cgc att gaa tgg cgc gac ggc tgg ccg tac 912 
Arg Glu Thr Gly lie Ala Arg lie Glu Trp Arg Asp Gly Trp Pro Tyr 
290 295 300 

gtg gaa ggc ggc aag cac gcg cag ctg acc gtg aaa ggc ccg caa gta 960 
Val Glu Gly Gly Lys His Ala Gin Leu Thr Val Lys Gly Pro Gin Val 
305 310 315 320 

gec gag cag cct gca gec gtt ccg ggc aac tgg egg gac gat ttc gac 1008 
Ala Glu Gin Pro Ala Ala Val Pro Gly Asn Trp Arg Asp Asp Phe Asp 
325 330 335 

gec agt teg ctt gac ccg gag ctg cag acc ctg cgc att ccg ttc gac 1056 
Ala Ser Ser Leu Asp Pro Glu Leu Gin Thr Leu Arg lie Pro Phe Asp 
340 345 350 

gac acc etc ggc teg etc acc gcg cgc ccg ggc ttc tta egg etc tat 1104 
Asp Thr Leu Gly Ser Leu Thr Ala Arg Pro Gly Phe Leu Arg Leu Tyr 
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355 360 365 

ggc aac gac teg etc aat teg ace ttc acc caa teg ace gtg gcg cgc 1152 
Gly Asn Asp Ser Leu Asn Ser Thr Phe Thr Gin Ser Thr Val Ala Arg 
370 375 380 

cgc tgg cag cac ttc gec ttc egg gca gaa acg egg atg gag ttc teg 1200 
Arg Trp Girt His Phe Ala Phe Arg Ala Glu Thr Arg Met Glu Phe Ser 
385 390 395 400 

ccg gtg cac ttc cag cag age gcg ggg ctg acc tgc tac tac aac age 1248 
Pro Val His Phe Gin Gin Ser Ala Gly Leu Thr Cys Tyr Tyr Asn Ser 
405 410 415 

aaa aac tgg age tac tgc ttt gtg gac tac gag gag gga cag ggt aga 1296 
Lys Asn Trp Ser Tyr Cys Phe Val Asp Tyr Glu Glu Gly Gin Gly Arg 
420 425 430 

acc ate aaa gtt ate cag etc gac cac aac gtg ccg teg tgg ccg ctg 1344 
Thr lie Lys Val lie Gin Leu Asp His Asn Val Pro Ser Trp Pro Leu 
435 440 445 

cac gag cag ccc att ccg gtg ccg gaa cat gcg gag age gtc tgg ctg 1392 
His Glu Gin Pro lie Pro Val Pro Glu His Ala Glu Ser Val Trp Leu 
450 455 460 

egg gtg g ac gtg g at acg ctg gtc tac cgc tac age tac teg ttt gat 1440 
Arg Val Asp Val Asp Thr Leu Val Tyr Arg Tyr Ser Tyr Ser Phe Asp 
465 470 475 480 

ggc gag acg tgg cac acc gtg ccg gtg acg tat gag gcg tgg aag ctg 1488 
Gly Glu Thr Trp His Thr Val Pro Val Thr Tyr Glu Ala Trp Lys Leu 
485 490 495 

teg gac gac tac ate ggc ggg cgc ggc ttc ttc acc ggc gcg ttt gtg 1536 
Ser Asp Asp Tyr lie Gly Gly Arg Gly Phe Phe Thr Gly Ala Phe Val 
500 505 510 

ggc ctg cac tgc gag gac ate age ggc gac ggc tgc tac gcg gac ttc 1584 
Gly Leu His Cys Glu Asp lie Ser Gly Asp Gly Cys Tyr Ala Asp Phe 
515 520 525 

gac tac ttc acc tac gag ccg gtc taa 1611 
Asp Tyr Phe Thr Tyr Glu Pro Val * 
530 535 



<210> 273 
<211> 978 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (978) 

<400> 273 

ttg cat atg aaa aaa ata ate ttt get ttt att ata tta ttt gtg ttt 48 
Met His Met Lys Lys He He Phe Ala Phe He He Leu Phe Val Phe 
1 5 10 15 
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tta etc cct atg att att ttt tac caa ccc tgg gtc aat gca tta ccg 96 
Leu Leu Pro Met lie He Phe Tyr Gin Pro Trp Val Asn Ala Leu Pro 
20 25 30 

tea acg ccc cga cac gca agt cct gag caa tta gaa aaa aca gtt cgt 144 
Ser Thr Pro Arg His Ala Ser Pro Glu Gin Leu Glu Lys Thr Val Arg 
35 40 45 

tat ctt aca caa act gtg cat cca cgt agt gec gac aat att gat aat 192 
Tyr Leu Thr Gin Thr Val His Pro Arg Ser Ala Asp Asn He Asp Asn 
50 55 60 

ctg aat agg tec gca gaa tat ata aaa gag gtc ttt gtc agt age ggt 240 
Leu Asn Arg Ser Ala Glu Tyr He Lys Glu Val Phe Val Ser Ser Gly 
65 70 75 80 

gec aga gtt ace teg cag gac gtc ccc att acg ggt ggc ccc tac aaa 288 
Ala Arg Val Thr Ser Gin Asp Val Pro He Thr Gly Gly Pro Tyr Lys 
85 90 95 

aac att gtt get gat tat ggt cct gee gat gga ccg ctg att att att 336 
Asn He Val Ala Asp Tyr Gly Pro Ala Asp Gly Pro Leu He He He 
100 105 110 

ggt gcg cat tat gac tct gec age agt tat gaa aac gat caa ttg ace 384 
Gly Ala His Tyr Asp Ser Ala Ser Ser Tyr Glu Asn Asp Gin Leu Thr 
115 120 125 

tat acg ccg ggc gcg gat gat aac gee age ggt gtg gca gga tta etc 432 
Tyr Thr Pro Gly Ala Asp Asp Asn Ala Ser Gly Val Ala Gly Leu Leu 
130 135 140 

gaa ctg gca cgt ttg tta cat cag caa gta ccg aaa aca ggc gtg cag 480 
Glu Leu Ala Arg Leu Leu His Gin Gin Val Pro Lys Thr Gly Val Gin 
145 150 155 160 

ctg gtc gee tat gcg teg gaa gaa ccg ccc ttc ttt cgt age gat gaa 528 
Leu Val Ala Tyr Ala Ser Glu Glu Pro Pro Phe Phe Arg Ser Asp Glu 
165 170 175 

atg ggg age gcg gtg cat gca get teg ctt gag cgt cca gtg aaa tta 576 
Met Gly Ser Ala Val His Ala Ala Ser Leu Glu Arg Pro Val Lys Leu 
180 185 190 

atg ata gca ctg gag atg att ggc tat tac gac tct gcg cct gga age 624 
Met He Ala Leu Glu Met He Gly Tyr Tyr Asp Ser Ala Pro Gly Ser 
195 200 205 

cag aat tac cct tac ccg gca atg tec tgg ctt tat ccc gat egg gga 672 
Gin Asn Tyr Pro Tyr Pro Ala Met Ser Trp Leu Tyr Pro Asp Arg Gly 
210 215 220 

gac ttt att gec gtg gtc ggc aga ata cag gat ate aac gee gtt cgt 720 
Asp Phe He Ala Val Val Gly Arg He Gin Asp He Asn Ala Val Arg 
225 230 235 240 

cag gta aaa gcg gca ttg ttg tea tct cag gat tta tct gtt tat tct 768 
Gin Val Lys Ala Ala Leu Leu Ser Ser Gin Asp Leu Ser Val Tyr Ser 
245 250 255 

atg aat acc cca ggg ttt att ccc ggt att gat ttc tct gac cac ctg 816 
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Met Asn Thr Pro Gly Phe lie Pro Gly lie Asp Phe Ser Asp His Leu 
260 265 270 

aat tat tgg caa cac gat att ccc gcc ata atg att act gac acc get 864 
Asn Tyr Trp Gin His Asp lie Pro Ala lie Met lie Thr Asp Thr Ala 
275 280 285 

ttt tat cgt aat aaa caa tac cac ttg ccc ggt gat acc gca gac aga 912 
Phe Tyr Arg Asn Lys Gin Tyr His Leu Pro Gly Asp Thr Ala Asp Arg 
290 295 300 

ttg aat tat cag aaa atg get cag gta gtg gat ggt gtt ata act ttg 960 
Leu Asn Tyr Gin Lys Met Ala Gin Val Val Asp Gly Val lie Thr Leu 
305 310 315 320 

tta tac aac agt aaa taa 978 
Leu Tyr Asn Ser Lys * 
325 



<210> 274 
<211> 963 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (963) 

<400> 274 

gtg atg ate aaa acg cgt ttt tct cgc tgg eta acg ttt ttt acg ttc 48 
Met Met lie Lys Thr Arg Phe Ser Arg Trp Leu Thr Phe Phe Thr Phe 
1 5 10 15 

gcc get gcc gtg gcg ctg gcg eta ccg gca aaa gcc aac acc tgg ccg 96 
Ala Ala Ala Val Ala Leu Ala Leu Pro Ala Lys Ala Asn Thr Trp Pro 
20 25 30 

ctg ccg cca gcg ggc agt cgt ctg gtt ggc gaa aac aaa ttt cat gtg 144 
Leu Pro Pro Ala Gly Ser Arg Leu Val Gly Glu Asn Lys Phe His Val 
35 40 45 

gtg gaa aat gac ggt: ggt tct ctg gaa gcc ate gcc aaa aaa tac aac 192 
Val Glu Asn Asp Gly Gly Ser Leu Glu Ala lie Ala Lys Lys Tyr Asn 
50 55 60 

gtc ggc ttt etc get ctg tta cag get aac ccc ggc gtt gat cct tac 240 
Val Gly Phe Leu Ala Leu Leu Gin Ala Asn Pro Gly Val Asp Pro Tyr 
65 70 75 80 

gta ccg cgc gcg ggc age gtg tta acg ate ccg ttg caa acc eta ctt 288 
Val Pro Arg Ala Gly Ser Val Leu Thr lie Pro Leu Gin Thr Leu Leu 
85 90 95 

cca gat gcg ccg cgc gaa ggc att gtg ate aac att gcg gag ctg cgt 336 
Pro Asp Ala Pro Arg Glu Gly lie Val lie Asn lie Ala Glu Leu Arg 
100 105 110 

etc tat tac tac ccg ccg ggt aaa aat teg gta acc gtg tat cca ata 384 
Leu Tyr Tyr Tyr Pro Pro Gly Lys Asn Ser Val Thr Val Tyr Pro He 
115 120 125 
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ggt att ggt cag tta ggt ggt gac acg ctg aca ccg aca atg gtg acc 432 
Gly lie Gly Gin Leu Gly Gly Asp Thr Leu Thr Pro Thr Met Val Thr 
130 135 140 

acc gtt tea gac aaa cgt gca aac cca acc tgg acg cca acg gca aac 480 
Thr Val Ser Asp Lys Arg Ala Asn Pro Thr Trp Thr Pro Thr Ala Asn 
145 150 155 160 

ate cgc gec cgt tat aaa gca cag gga att gag ttg cct gcg gta gtg 528 
lie Arg Ala Arg Tyr Lys Ala Gin Gly lie Glu Leu Pro Ala Val Val 
165 170 175 

ccg get gga ctg gat aac cca atg ggc cat cat gcg att cgt ctg gcg 576 
Pro Ala Gly Leu Asp Asn Pro Met Gly His His Ala lie Arg Leu Ala 
180 185 190 

gec tat ggc ggc gtt tat ttg ctt cat ggt acg aac gec gat ttc ggc 624 
Ala Tyr Gly Gly Val Tyr Leu Leu His Gly Thr Asn Ala Asp Phe Gly 
195 200 205 

att ggc atg egg gta agt tct ggc tgt att cgt ctg egg gat gac gat 672 
lie Gly Met Arg Val Ser Ser Gly Cys He Arg Leu Arg Asp Asp Asp 
210 215 220 

ate aaa aca etc ttt age cag gtc acc cca ggc acc aaa gtg aat ate 720 
He Lys Thr Leu Phe Ser Gin Val Thr Pro Gly Thr Lys Val Asn He 
225 230 235 240 

ate aac act ccg ata aaa gtc tct gec gaa cca aac ggt gcg cgt ctg 768 
He Asn Thr Pro He Lys Val Ser Ala Glu Pro Asn Gly Ala Arg Leu 
245 250 255 

gtt gaa gta cat cag ccg ctg tea gag aag att gat gac gat ccg cag 816 
Val Glu Val His Gin Pro Leu Ser Glu Lys He Asp Asp Asp Pro Gin 
260 265 270 

ctg ctg cca att acg ctg aat age gca atg caa tea ttt aaa gat gca 864 
Leu Leu Pro He Thr Leu Asn Ser Ala Met Gin Ser Phe Lys Asp Ala 
275 280 285 

gca caa act gac get gaa gtg atg caa cat gtg atg gat gtc cgt tec 912 
Ala Gin Thr Asp Ala Glu Val Met Gin His Val Met Asp Val Arg Ser 
290 295 300 

ggg atg ccg gtg gat gtc cgc cgt cat caa gtg age cca caa acg ctg 960 
Gly Met Pro Val Asp Val Arg Arg His Gin Val Ser Pro Gin Thr Leu 
305 310 315 320 

taa 963 



<210> 275 
<211> 1485 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . (1485) 
<400> 275 

gtg gtc gca ate cat ctt ttg ccg gtt agt tac aat tct gcg aca tec 48 
Met Val Ala He His Leu Leu Pro Val Ser Tyr Asn Ser Ala Thr Ser 
15 10 15 

ace gtg aat ate agt get aga ate ata ccc ctg ttg att att cac caa 96 
Thr Val Asn lie Ser Ala Arg He He Pro Leu Leu He He His Gin 
20 25 30 

aga tat aaa att cct atg cca aaa gtt cag gee gac ggc ctg cca ttg 144 
Arg Tyr Lys He Pro Met Pro Lys Val Gin Ala Asp Gly Leu Pro Leu 
35 40 45 

ccc cag cga tac ggt gcg ata tta acc att gtg att ggt att teg atg 192 
Pro Gin Arg Tyr Gly Ala He Leu Thr He Val He Gly He Ser Met 
50 55 60 

gee gtc ctt gac ggc gca ate gee aac gtc gec ctg cca aca ate gee 240 
Ala Val Leu Asp Gly Ala He Ala Asn Val Ala Leu Pro Thr He Ala 
65 70 75 80 

acg gac ctt cat gee acg cca gee agt tec ate tgg gta gtg aac gee 288 
Thr Asp Leu His Ala Thr Pro Ala Ser Ser He Trp Val Val Asn Ala 
85 90 95 

tat caa ate gee att gtc ate tec ctg etc teg ttt teg ttt ctg ggc 336 
Tyr Gin He Ala He Val He Ser Leu Leu Ser Phe Ser Phe Leu Gly 
100 105 110 

gat atg ttt ggc tat cga cgt att tat aaa tgc ggt ctg gtc gtt ttt 384 
Asp Met Phe Gly Tyr Arg Arg He Tyr Lys Cys Gly Leu Val Val Phe 
115 120 125 

ctg ttg tct tea ctg ttc tgc gee ctt tct gat teg ctg caa atg etc 432 
Leu Leu Ser Ser Leu Phe Cys Ala Leu Ser Asp Ser Leu Gin Met Leu 
130 135 140 

acc ctt gcg cgt gtc ata caa ggt ttc ggc ggt gca gcg ttg atg age 480 
Thr Leu Ala Arg Val He Gin Gly Phe Gly Gly Ala Ala Leu Met Ser 
145 *" 150 155 160 

gtt aat acc gca ctt ate cgc ctg ate tat cca caa cgt ttt ctg ggt 528 
Val Asn Thr Ala Leu He Arg Leu He Tyr Pro Gin Arg Phe Leu Gly 
165 170 175 

aga ggg atg ggc ata aac teg ttt att gtt gee gtc tct tct get gee 576 
Arg Gly Met Gly He Asn Ser Phe He Val Ala Val Ser Ser Ala Ala 
180 185 190 

ggg ccg aca att get gca gca ate etc tec ate gca tec tgg aaa tgg 624 
Gly Pro Thr He Ala Ala Ala He Leu Ser He Ala Ser Trp Lys Trp 
195 200 205 

tta ttt tta ate aac gta ccg tta ggt att ate gec ctg ctt ctg gcg 672 
Leu Phe Leu He Asn Val Pro Leu Gly He He Ala Leu Leu Leu Ala 
210 215 220 

atg cgt ttt ctg cca ccc aat ggt tct cgc gec agt aaa ccc cgt ttc 720 
Met Arg Phe Leu Pro Pro Asn Gly Ser Arg Ala Ser Lys Pro Arg Phe 
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225 230 235 240 

gac ctg ccc age gec gtg atg aac gcg tta acc ttc ggc ctg ctt ate 768 
Asp I*eu Pro Ser Ala Val Met Asn Ala Leu Thr Phe Gly Leu Leu lie 
245 250 255 

act gcg ttg agt ggt ttc get cag ggg caa teg ctg acg tta att get 816 
Thr Ala Leu Ser Gly Phe Ala Gin Gly Gin Ser Leu Thr Leu lie Ala 
260 265 270 

gcg gaa ctg gtg gta atg gtt gtt gtt ggt att ttc ttt att cgc cgc 864 
Ala Glu Leu Val Val Met Val Val Val Gly lie Phe Phe lie Arg Arg 
275 280 285 

cag ctt tct ctt ccc gta ccg ctg eta ccg gtg gat tta ctg cgt ate 912 
Gin Leu Ser Leu Pro Val Pro Leu Leu Pro Val Asp Leu Leu Arg lie 
290 295 300 

ccg ctg ttt tea ctt tct att tgc aca tct gtt tgc tct ttc tgc gca 960 
Pro Leu Phe Ser Leu Ser He Cys Thr Ser Val Cys Ser Phe Cys Ala 
305 310 315 320 

caa atg ctg gca atg gtt tec ctg ccc ttt tac ctg caa acc gtg etc 1008 
Gin Met Leu Ala Met Val Ser Leu Pro Phe Tyr Leu Gin Thr Val Leu 
325 330 335 

ggg cgt agt gaa gtc gaa aca ggt tta ctt ctg aca ccg tgg ccg tta 1056 
Gly Arg Ser Glu Val Glu Thr Gly Leu Leu Leu Thr Pro Trp Pro Leu 
340 345 350 

gca acg atg gtg atg get ccg ctg gca ggc tat ttg att gaa cgc gta 1104 
Ala Thr Met Val Met Ala Pro Leu Ala Gly Tyr Leu lie Glu Arg Val 
355 360 365 

cat gca gga ttg ctg ggg get tta ggg ttg ttc ate atg get gcg ggg 1152 
His Ala Gly Leu Leu Gly Ala Leu Gly Leu Phe He Met Ala Ala Gly 
370 375 380 

ctt ttt tec ctg gtt ctg ctg ccc gcg tea cct gcg gat ate aat att 1200 
Leu Phe Ser Leu Val Leu Leu Pro Ala Ser Pro Ala Asp He Asn He 
385 390 395 400 

ate tgg ccg atg ate tta tgt ggt get gga ttt ggc tta ttc cag tea 1248 
He Trp Pro Met He Leu Cys Gly Ala Gly Phe Gly Leu Phe Gin Ser 
405 410 415 

ccc aat aac cac acc att att acc tec gcg cct cgc gaa cgt age ggt 1296 
Pro Asn Asn His Thr He He Thr Ser Ala Pro Arg Glu Arg Ser Gly 
420 425 430 

gga gee agt ggc atg tta gga acg get cgt eta ctg ggt cag agt age 1344 
Gly Ala Ser Gly Met Leu Gly Thr Ala Arg Leu Leu Gly Gin Ser Ser 
435 440 445 

ggc gcg gcg ctg gtg gcg ctg atg eta aat cag ttt gga gat aat ggt 1392 
Gly Ala Ala Leu Val Ala Leu Met Leu Asn Gin Phe Gly Asp Asn Gly 
450 455 460 

aca cac gtc teg ctg atg get gcg get att ctg gca gtg att get gee 1440 
Thr His Val Ser Leu Met Ala Ala Ala He Leu Ala Val He Ala Ala 
465 470 475 480 
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tgt gtc agt ggt tta cgt ate act cag cca cga tec agg gca taa 1485 
Cys Val Ser Gly Leu Arg lie Thr Gin Pro Arg Ser Arg Ala * 
485 490 



<210> 276 
<211> 1434 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1434) 

<400> 276 

atg aaa gta acg ctg cca gag ttt gaa cgt gca gga gtg atg gtg gtt 48 
Met Lys Val Thr Leu Pro Glu Phe Glu Arg Ala Gly Val Met Val Val 
1 5 10 15 

ggt gat gtg atg ctg gat cgt tac tgg tac ggc ccc acc agt cgt ate 96 
Gly Asp Val Met Leu Asp Arg Tyr Trp Tyr Gly Pro Thr Ser Arg He 
20 25 30 

teg ccg gaa gcg ccg gtg ccc gtg gtt aaa gtg aat acc ate gaa gaa 144 
Ser Pro Glu Ala Pro Val Pro Val Val Lys Val Asn Thr He Glu Glu 
35 40 45 

cgt ccg ggc ggc gcg get aac gtg gcg atg aat ate get tct etc ggt 192 
Arg Pro Gly Gly Ala Ala Asn Val Ala Met Asn He Ala Ser Leu Gly 
50 55 60 

get aat gca cgc ctg gtc ggg ttg acg ggc att gac gat gca gcg cgc 240 
Ala Asn Ala Arg Leu Val Gly Leu Thr Gly He Asp Asp Ala Ala Arg 
65 70 75 80 

gcg ctg agt aaa tct ctg gee gac gtc aac gtc aaa tgc gac ttc gtt 288 
Ala Leu Ser Lys Ser Leu Ala Asp Val Asn Val Lys Cys Asp Phe Val 
85 90 95 

tct gta ccg acg cat ccg acc att acc aaa tta egg gta ctt tec cgc 336 
Ser Val Pro Thr His Pro Thr He Thr Lys Leu Arg Val Leu Ser Arg 
100 105 110 

aac caa cag ctg ate cgt ctg gat ttt gaa gaa ggt ttc gaa ggt gtt 384 
Asn Gin Gin Leu lie Arg Leu Asp Phe Glu Glu Gly Phe Glu Gly Val 
115 ~ 120 125 

gat ccg cag ccg ctg cac gag egg att aat cag gcg ctg agt teg att 432 
Asp Pro Gin Pro Leu His Glu Arg He Asn Gin Ala Leu Ser Ser He 
130 135 140 

ggc gcg ctg gtg ctt tct gac tac gee aaa ggt gcg ctg gca age gta 480 
Gly Ala Leu Val Leu Ser Asp Tyr Ala Lys Gly Ala Leu Ala Ser Val 
145 150 155 160 

cag cag atg ate caa ctg gcg cgt aaa gcg ggt gtt ccg gtg ctg att 528 
Gin Gin Met He Gin Leu Ala Arg. Lys Ala Gly Val Pro Val Leu He 
165 170 175 
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gat cca aaa ggt acc gat ttt gag cgc tac cgc ggc get acg ctg tta 576 
Asp Pro Lys Gly Thr Asp Phe Glu Arg Tyr Arg Gly Ala Thr Leu Leu 
180 185 190 

acg ccg aat etc teg gaa ttt gaa get gtt gtc ggt aaa tgt aag acc 624 
Thr Pro Asn Leu Ser Glu Phe Glu Ala Val Val Gly Lys Cys Lys Thr 
195 200 205 

gaa gaa gag att gtt gag cgc ggc atg aaa ctg att gee gat tac gaa 672 
Glu Glu Glu lie Val Glu Arg Gly Met Lys Leu lie Ala Asp Tyr Glu 
210 215 220 

etc teg get ctg tta gtg acc cgt tec gaa cag ggt atg teg ctg ctg 720 
Leu Ser Ala Leu Leu Val Thr Arg Ser Glu Gin Gly Met Ser Leu Leu 
225 230 235 240 

caa ccg ggt aaa gcg ccg ctg cat atg cca acc caa gcg cag gaa gtg 768 
Gin Pro Gly Lys Ala Pro Leu His Met Pro Thr Gin Ala Gin Glu Val 
245 250 255 

tat gac gtt acc ggt gcg ggc gac acg gtg att ggc gtc ctg gcg gca 816 
Tyr Asp Val Thr Gly Ala Gly Asp Thr Val lie Gly Val Leu Ala Ala 
260 265 270 

acg ctg gca gcg ggt aat teg ctg gaa gaa gec tgc ttc ttt gee aat 864 
Thr Leu Ala Ala Gly Asn Ser Leu Glu Glu Ala Cys Phe Phe Ala Asn 
275 280 285 

gcg gcg get ggc gtg gtg gtc ggc aaa ctg gga acc tec acg gtt teg 912 
Ala Ala Ala Gly Val Val Val Gly Lys Leu Gly Thr Ser Thr Val Ser 
290 295 300 

ccg ate gag ctg gaa aat get gta cgt gga cgt gca gat aca ggc ttt 960 
Pro He Glu Leu Glu Asn Ala Val Arg Gly Arg Ala Asp Thr Gly Phe 
305 310 315 320 

ggc gtg atg acc gaa gag gaa ctg aag ctg gec gta gcg gca gcg cgt 1008 
Gly Val Met Thr Glu Glu Glu Leu Lys Leu Ala Val Ala Ala Ala Arg 
325 330 335 

aaa cgt ggt gaa aaa gtg gtg atg acc aac ggt gtc ttt gac ate ctg 1056 
Lys Arg Gly Glu Lys Val Val Met Thr Asn Gly Val Phe Asp He Leu 
340 345 350 

cac gee ggg cac gtc tct tat ctg gca aat gec cgc aag ctg ggt gac 1104 
His Ala Gly His Val Ser Tyr Leu Ala Asn Ala Arg Lys Leu Gly Asp 
355 360 365 

cgc ttg att gtt gec gtc aac age gat gec tec acc aaa egg ctg aaa 1152 
Arg Leu He Val Ala Val Asn Ser Asp Ala Ser Thr Lys Arg Leu Lys 
370 375 380 

999 gat tec cgc ccg gta aac cca etc gaa cag cgt atg att gtg ctg 1200 
Gly Asp Ser Arg Pro Val Asn Pro Leu Glu Gin Arg Met He Val Leu 
385 390 395 400 

ggc gca ctg gaa gcg gtc gac tgg gta gtg teg ttt gaa gag gac acg 1248 
Gly Ala Leu Glu Ala Val Asp Trp Val Val Ser Phe Glu Glu Asp Thr 
405 410 415 

ccg cag cgc ttg ate gec ggg ate ttg cca gat ctg ctg gtg aaa ggc 1296 
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Pro Gin Arg Leu He Ala Gly He Leu Pro Asp Leu Leu Val Lys Gly 
420 425 430 

ggc gac tat aaa cca gaa gag att gcc ggg agt aaa gaa gtc tgg gcc 1344 
Gly Asp Tyr Lys Pro Glu Glu He Ala Gly Ser Lys Glu Val Trp Ala 
435 440 445 

aac ggt ggc gaa gtg ttg gtg etc aac ttt gaa gac ggt tgc teg acg 1392 
Asn Gly Gly Glu Val Leu Val Leu Asn Phe Glu Asp Gly Cys Ser Thr 
450 455 460 

acc aac ate ate aag aag ate caa cag gat aaa aaa ggc taa 1434 
Thr Asn He He Lys Lys He Gin Gin Asp Lys Lys Gly * 
465 470 475 



<210> 277 
<211> 2841 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2841) 

<400> 277 

atg aag ccg etc tct tea ccg tta cag cag tac tgg cag acc gtt gtt 48 
Met Lys Pro Leu Ser Ser Pro Leu Gin Gin Tyr Trp Gin Thr Val Val 
15 10 15 

gag egg ctg cca gag cct tta gcc gag gaa tea ctt age gca cag gcg 96 
Glu Arg Leu Pro Glu Pro Leu Ala Glu Glu Ser Leu Ser Ala Gin Ala 
20 25 30 

aag tea gta ctt act ttt agt gat ttt gtg cag gac age gtg att gcg 144 
Lys Ser Val Leu Thr Phe Ser Asp Phe Val Gin Asp Ser Val He Ala 
35 40 45 

cat cca gag tgg ctg acg gaa ctg gaa age caa ccg ccg cag gcc gac 192 
His Pro Glu Trp Leu Thr Glu Leu Glu Ser Gin Pro Pro Gin Ala Asp 
50 55 60 

gaa tgg cag cat tac gcg gca tgg ttg cag gag gcg etc tgt aat gtg 240 
Glu Trp Gin His Tyr Ala Ala Trp Leu Gin Glu Ala Leu Cys Asn Val 
65 70 75 80 

agt gac gaa gcc ggg tta atg cgc gag ctg egg eta ttc egg egg cgc 288 
Ser Asp Glu Ala Gly Leu Met Arg Glu Leu Arg Leu Phe Arg Arg Arg 
85 90 95 

* 

att atg gtg cgc ate gcc tgg gcg caa acg ctg gca ctg gtt act gaa 336 
He Met Val Arg He Ala Trp Ala Gin Thr Leu Ala Leu Val Thr Glu 
100 105 110 

gag age ata ttg cag cag etc age tat ctg gcg gag acg ctg att gtt 384 
Glu Ser He Leu Gin Gin Leu Ser Tyr Leu Ala Glu Thr Leu He Val 
115 120 125 

gcg gcg cgt gac tgg ctg tat gac gcc tgc tgc cgc gag tgg gga acg 432 
Ala Ala Arg Asp Trp Leu Tyr Asp Ala Cys Cys Arg Glu Trp Gly Thr 
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130 135 140 

ccg tgc aat gcg cag ggc gaa gcg caa ccg ctg ctg att tta ggc atg 480 
Pro Cys Asn Ala Gin Gly Glu Ala Gin Pro Leu Leu lie Leu Gly Met 
145 150 155 160 

ggt aag ctg ggc ggt ggg gag ctg aat ttc tec tct gat ate gat ctg 528 
Gly Lys Leu Gly Gly Gly Glu Leu Asn Phe Ser Ser Asp lie Asp Leu 
165 170 175 

att ttt gec tgg ccg gaa cat ggt tgt acg cag ggt gga cgc egg gaa 576 
lie Phe Ala Trp Pro Glu His Gly Cys Thr Gin Gly Gly Arg Arg Glu 
180 185 190 

ctg gat aac gcg cag ttt ttt acc cgc atg ggg cag egg ctg att aaa 624 
Leu Asp Asn Ala Gin Phe Phe Thr Arg Met Gly Gin Arg Leu lie Lys 
195 200 205 

gtg ctg gat caa cca acg cag gat ggc ttc gtc tat cgc gtg gat atg 672 
Val Leu Asp Gin Pro Thr Gin Asp Gly Phe Val Tyr Arg Val Asp Met 
210 215 220 

egg ctg cgt ccg ttt ggc gaa agt ggc ccg ctg gtg ctg age ttt gec 720 
Arg Leu Arg Pro Phe Gly Glu Ser Gly Pro Leu Val Leu Ser Phe Ala 
225 230 235 240 

gcg ttg gaa gat tat tac cag gag cag ggg cgc gac tgg gag cgt tac 768 
Ala Leu Glu Asp Tyr Tyr Gin Glu Gin Gly Arg Asp Trp Glu Arg Tyr 
245 250 ~ 255 

gcg atg gtc aag gcg egg att atg ggc gat age gaa ggc gtc tat get 816 
Ala Met Val Lys Ala Arg lie Met Gly Asp Ser Glu Gly Val Tyr Ala 
260 265 270 

aac gag ttg cgt gcg atg ctg cgc ccg ttt gtt ttc cgt cgt tac ate 864 
Asn Glu Leu Arg Ala Met Leu Arg Pro Phe Val Phe Arg Arg Tyr He 
275 280 285 

gat ttc age gtg att cag teg ctg cgc aac atg aaa ggg atg att gec 912 
Asp Phe Ser Val He Gin Ser Leu Arg Asn Met Lys Gly Met He Ala 
290 295 300 

cgt gaa gtg cgt cga cgt ggt ttg acc gac aat ate aaa etc ggc gca 960 
Arg Glu Val Arg Arg Arg Gly Leu Thr Asp Asn He Lys Leu Gly Ala 
305 310 315 320 

ggc ggc att cgc gaa att gaa ttt ate gtt cag gtg ttc cag etc att 1008 
Gly Gly He Arg Glu He Glu Phe He Val Gin Val Phe Gin Leu He 
325 330 335 

cgc ggc gga cgc gaa ccg teg ctg caa teg cgc tct tta ctg cca acg 1056 
Arg Gly Gly Arg Glu Pro Ser Leu Gin Ser Arg Ser Leu Leu Pro Thr 
340 345 350 

etc age gec att gec gag ctg cat ctg ctt tct gaa aac gat get gaa 1104 
Leu Ser Ala He Ala Glu Leu His Leu Leu Ser Glu Asn Asp Ala Glu 
355 360 365 

caa ttg cga gtg gcg tat ctg ttc ctg egg cgt ctg gaa aac ctg ctg 1152 
Gin Leu Arg Val Ala Tyr Leu Phe Leu Arg Arg Leu Glu Asn Leu Leu 
370 375 380 
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caa age att aac gac gaa caa acc cag acg ctt cct tct gat gag ctt 1200 
Gin Ser lie Asn Asp Glu Gin Thr Gin Thr Leu Pro Ser Asp Glu Leu 
385 390 395 400 

aat cgt gcg egg ctg gcg tgg gcg atg gac ttt get gac tgg ccg caa 1248 
Asn Arg Ala Arg Leu Ala Trp Ala Met Asp Phe Ala Asp Trp Pro Gin 
405 410 415 

ctg acc ggg gcg ctg acc gca cat atg acc aat gtg cgc egg gtg ttt 1295 
Leu Thr Gly Ala Leu Thr Ala His Met Thr Asn Val Arg Arg Val Phe 
420 425 430 

aat gaa ttg att ggc gac gat gaa agt gaa act cag gaa gag teg ctg 1344 
Asn Glu Leu He Gly Asp Asp Glu Ser Glu Thr Gin Glu Glu Ser Leu 
435 440 445 

teg gaa cag tgg cgt gag ctg tgg cag gat gcg ttg cag gaa gat gac 1392 
Ser Glu Gin Trp Arg Glu Leu Trp Gin Asp Ala Leu Gin Glu Asp Asp 
450 455 460 

act acg cca gtg ctg gcg cat ctt age gag gat gat cgc aaa cag gtg 1440 
Thr Thr Pro Val Leu Ala His Leu Ser Glu Asp Asp Arg Lys Gin Val 
465 470 475 480 

eta acg ctg att gee gat ttc cgc aaa gag ctg gat aag cgc acc ate I486 
Leu Thr Leu He Ala Asp Phe Arg Lys Glu Leu Asp Lys Arg Thr He 
485 490 495 

ggg ccg cga gga cgt cag gtg etc gac cat ctg atg ccg cat ctg eta 1536 
Gly Pro Arg Gly Arg Gin Val Leu Asp His Leu Met Pro His Leu Leu 
500 505 510 

agt gat gtc tgt gcg cgt gaa gac get gee gtt acg ctg teg cgc att 1584 
Ser Asp Val Cys Ala Arg Glu Asp Ala Ala Val Thr Leu Ser Arg He 
515 520 525 

acc gee ttg ctg gtg ggg att gtt acc cgc acc acc tat tta gaa ttg 1632 
Thr Ala Leu Leu Val Gly lie Val Thr Arg Thr Thr Tyr Leu Glu Leu 
530 535 540 

etc agt gaa ttc ccc gcg gcg ctt aaa cat ttg att tct ctg tgt gec 1680 
Leu Ser Glu Phe Pro Ala Ala Leu Lys His Leu He Ser Leu Cys Ala 
545 550 555 560 

gcg teg ccg atg att gec age cag ctg gcg cgt tat cca tta ttg ctg 1728 
Ala Ser Pro Met He Ala Ser Gin Leu Ala Arg Tyr Pro Leu Leu Leu 
565 570 575 

gat gaa ttg etc gat cca aac acc ctt tac cag ccg acg gcg acc gat 1776 
Asp Glu Leu Leu Asp Pro Asn Thr Leu Tyr Gin Pro Thr Ala Thr Asp 
580 585 590 

gee tac cgc gat gag ttg cgc cag tat ttg ctg cgc gtg ccg gaa gat 1824 
Ala Tyr Arg Asp Glu Leu Arg Gin Tyr Leu Leu Arg Val Pro Glu Asp 
595 . 600 605 

gac gaa gag caa cag ctt gag gcg ctg cgt cag ttc aaa cag gcg cag 1872 
Asp Glu Glu Gin Gin Leu Glu Ala Leu Arg Gin Phe Lys Gin Ala Gin 
610 615 620 
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ctg tta cgc ate gec gca gcg gat ate gee ggt acg eta ccg gtg atg 1920 
Leu Leu Arg He Ala Ala Ala Asp He Ala Gly Thr Leu Pro Val Met 
625 630 635 640 

aaa gtg age gat cac tta ace tgg ctg gcg gaa gee atg ata gat gee 1968 
Lys Val Ser Asp His Leu Thr Trp Leu Ala Glu Ala Met He Asp Ala 
645 650 655 

gtc gtt cag cag gcg tgg gtt caa atg gtt gee cgc tac ggt aag ccg 2016 
Val Val Gin Gin Ala Trp Val Gin Met Val Ala Arg Tyr Gly Lys Pro 
660 665 670 

aat cac ctg aac gaa cgc gaa ggg cgt ggt ttt gcg gtg gtc ggc tac 2064 
Asn His Leu Asn Glu Arg Glu Gly Arg Gly Phe Ala Val Val Gly Tyr 
675 680 685 

ggc aag ctg ggc ggc tgg gag tta ggc tac agt tec gat ctt gac ctt 2112 
Gly Lys Leu Gly Gly Trp Glu Leu Gly Tyr Ser Ser Asp Leu Asp Leu 
690 695 700 

ate ttc etc cat gat tgc cca atg gat gcg atg act gac ggt gag egg 2160 
He Phe Leu His Asp Cys Pro Met Asp Ala Met Thr Asp Gly Glu Arg 
705 710 715 720 

gaa ate gac ggg egg cag ttt tat ctg cgt ctg gcg caa cgc att atg 2208 
Glu He Asp Gly Arg Gin Phe Tyr Leu Arg Leu Ala Gin Arg He Met 
725 730 735 

cat ctg ttc agt acg cgt ace tct tec ggc att ttg tat gaa gtg gat 2256 
His Leu Phe Ser Thr Arg Thr Ser Ser Gly He Leu Tyr Glu Val Asp 
740 745 750 

get cga ctg cgt ccg tec ggg gcg gcg gga atg ctg gtg aca tec gca 2304 
Ala Arg Leu Arg Pro Ser Gly Ala Ala Gly Met Leu Val Thr Ser Ala 
755 760 765 

gaa gca ttt gee gat tat cag aaa aac gag gee tgg acg tgg gaa cat 2352 
Glu Ala Phe Ala Asp Tyr Gin Lys Asn Glu Ala Trp Thr Trp Glu His 
770 775 780 

cag gcg ctg gtg cgt gcg cgt gta gtg tac ggc gat ccg cag etc ace 2400 
Gin Ala Leu Val Arg Ala Arg Val Val Tyr Gly Asp Pro Gin Leu Thr 
785 790 795 800 

gcg cac ttt gac gca gtg cgt cgc gag att atg acg ctg ccg cgt gaa 2448 
Ala His Phe Asp Ala Val Arg Arg Glu He Met Thr Leu Pro Arg Glu 
805 810 815 

ggt aaa act ctg caa acg gaa gtg egg gaa atg cgc gag aaa atg cgc 2496 
Gly Lys Thr Leu Gin Thr Glu Val Arg Glu Met Arg Glu Lys Met Arg 
820 825 830 

get cat etc ggc aat aaa cat cgc gat cgc ttt gat ate aaa get gat 2544 
Ala His Leu Gly Asn Lys His Arg Asp Arg Phe Asp He Lys Ala Asp 
835 840 845 

gaa ggg gga att acc gat ate gaa ttt att acc caa tat ctg gtg ttg 2592 
Glu Gly Gly He Thr Asp He Glu Phe He Thr Gin Tyr Leu Val Leu 
850 855 1 860 

cgc tac get cat gaa aaa ccg aag tta acg cgc tgg tea gac aac gtg 2640 
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Arg Tyr Ala His Glu Lys Pro Lys Leu Thr Arg Trp Ser Asp Asn Val 
865 870 875 880 

cgt att ctg gaa eta ctg gcg caa aac gac att atg gaa gag cag gaa 2688 
Arg lie Leu Glu Leu Leu Ala Gin Asn Asp lie Met Glu Glu Gin Glu 
885 890 895 

gcg atg gcg ctg acc cgt get tac act acg ctt cgc gat gaa ctt cat 2736 
Ala Met Ala Leu Thr Arg Ala Tyr Thr Thr Leu Arg Asp Glu Leu His 
900 905 910 

cat ctg gca tta cag gaa ttg ccg ggc cat gtg teg gag gat tgc ttc 2784 
His Leu Ala Leu Gin Glu Leu Pro Gly His Val Ser Glu Asp Cys Phe 
915 920 925 

acc gca gag cgt gaa ctg gtg egg gca age tgg cag aag tgg ctg gtg 2832 
Thr Ala Glu Arg Glu Leu Val Arg Ala Ser Trp Gin Lys Trp Leu Val 
930 935 940 

gaa gaa tga 2841 

Glu Glu * 

945 



<210> 278 
<211> 1302 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1302) 
<400> 278 

atg get cag gaa ate gaa tta aag ttt att gtt aat cac agt gee gtt 48 
Met Ala Gin Glu lie Glu Leu Lys Phe He Val Asn His Ser Ala Val 
15 10 15 

gag gcg ttg cgt gac cat etc aat acg ctg ggc ggc gag cac cat gac 96 
Glu Ala Leu Arg Asp His Leu Asn Thr Leu Gly Gly Glu His His Asp 
20 25 30 

ccc gtg cag ttg ctg aat att tac tac gaa acg ccg gat aac tgg ctg 144 
Pro Val Gin Leu Leu Asn He Tyr Tyr Glu Thr Pro Asp Asn Trp Leu 
35 40 45 

cgt ggg cac gat atg ggc tta cgt att cgt ggc gaa aac ggt cgc tat 192 
Arg Gly His Asp Met Gly Leu Arg He Arg Gly Glu Asn Gly Arg Tyr 
50 55 60 

gag atg acc atg aaa gtt gca gga aga gtg aca ggc ggc tta cat cag 240 
Glu Met Thr Met Lys Val Ala Gly Arg Val Thr Gly Gly Leu His Gin 
65 70 . 75 80 

cgc ccg gaa tat aac gtg gcg ttg age gaa ccg acg etc gac ctg gcg 288 
Arg Pro Glu Tyr Asn Val Ala Leu Ser Glu Pro Thr Leu Asp Leu Ala 
85 90 95 

cag tta ccg acg gaa gtc tgg ccg aac ggc gaa ttg ccc gec gat etc 336 
Gin Leu Pro Thr Glu Val Trp Pro Asn Gly Glu Leu Pro Ala Asp Leu 
100 105 110 
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gcc tec cgc gtg cag ccg ctg ttc age acc gat ttt tat cgc gaa aaa 384 
Ala Ser Arg Val Gin Pro Leu Phe Ser Thr Asp Phe Tyr Arg Glu Lys 
115 120 125 

tgg ctg gtg gcg gtc gat ggt age caa att gaa ate gcc etc gac cag 432 
Trp Leu Val Ala Val Asp Gly Ser Gin lie Glu lie Ala Leu Asp Gin 
130 135 140 

999 gaa gtg aaa gcg ggt gaa ttt get gaa cct ate tgt gag ctg gaa 480 
Gly Glu Val Lys Ala Gly Glu Phe Ala Glu Pro lie Cys Glu Leu Glu 
145 150 155 160 

ctg gaa ctg ctt age ggc gac acg cgc gcg gtg ctg aaa ctg gcg aac 528 
Leu Glu Leu Leu Ser Gly Asp Thr Arg Ala Val Leu Lys Leu Ala Asn 
165 170 175 

caa ctg gta teg caa acc gga tta cgc cag ggc age ctg age aaa gcg 576 
Gin Leu Val Ser Gin Thr Gly Leu Arg Gin Gly Ser Leu Ser Lys Ala 
180 185 190 

gcg cgt ggc tat cat ctg gcg cag ggc aat ccg gcg cgt gaa ate aaa 624 
Ala Arg Gly Tyr His Leu Ala Gin Gly Asn Pro Ala Arg Glu lie Lys 
195 200 205 

ccg acc acc att ttg cat gtt gcg gca aaa gcc gat gtg gaa cag ggg 672 
Pro Thr Thr lie Leu His Val Ala Ala Lys Ala Asp Val Glu Gin Gly 
210 215 220 

ctg gaa gcg gcg etc gag ctg gcg tta gcg caa tgg cag tat cat gaa 720 
Leu Glu Ala Ala Leu Glu Leu Ala Leu Ala Gin Trp Gin Tyr His Glu 
225 230 235 240 

gaa ctg tgg gta cgc ggc aac gat gcg gcg aaa gaa cag gtg ctg gca 768 
Glu Leu Trp Val Arg Gly Asn Asp Ala Ala Lys Glu Gin Val Leu Ala 
245 250 255 

gcc att age ctg gtc cgt cat acg ctg atg ctg ttc ggt ggt att gtg 816 
Ala lie Ser Leu Val Arg His Thr Leu Met Leu Phe Gly Gly lie Val 
260 265 270 

ccg cgt aaa gcg age act cac tta cgt gat ctg ctg act caa tgc gag 864 
Pro Arg Lys Ala Ser Thr His Leu Arg Asp Leu Leu Thr Gin Cys Glu 
275 280 285 

gcg acc att get tct gcg gtg tct gcc gtg acg gcg gtc tac tct acc 912 
Ala Thr lie Ala Ser Ala Val Ser Ala Val Thr Ala Val Tyr Ser Thr 
290 295 300 

gaa acg gca atg gcg aag ctg gcg ttg acc gaa tgg ttg gta age aaa 960 
Glu Thr Ala Met Ala Lys Leu Ala Leu Thr Glu Trp Leu Val Ser Lys 
305 310 315 320 

gca tgg cag cca ttt tta gat gcc aaa gcg cag ggc aaa ate age gac 1008 
Ala Trp Gin Pro Phe Leu Asp Ala Lys Ala Gin Gly Lys lie Ser Asp 
325 330 335 

tec ttc aaa cgc ttt gcc gat ate cat ctt tec cgc cat gcc get gaa 1056 
Ser Phe Lys Arg Phe Ala Asp lie His Leu Ser Arg His Ala Ala Glu 
340 345 350 
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ctg aaa age gtt ttc tgc cag ccg tta ggc gat cgc tac cgt gac cag 1104 
Leu Lys Ser Val Phe Cys Gin Pro I-eu Gly Asp Arg Tyr Arg Asp Gin 
355 360 365 

ttg cca cgc ctg acg cgt gat att gac tea ata ctg ttg ctg gcg ggt 1152 
Leu Pro Arg Leu Thr Arg Asp lie Asp Ser lie Leu Leu Leu Ala Gly 
370 375 380 

tac tat gat cct gtc gtc gcg caa gec tgg ctg gag aac tgg cag ggg 1200 
Tyr Tyr Asp Pro Val Val Ala Gin Ala Trp Leu Glu Asn Trp Gin Gly 
385 390 395 400 

ctg cat cac get att gcg acc ggg caa cgc ate gaa att gaa cat ttc 1248 
Leu His His Ala lie Ala Thr Gly Gin Arg lie Glu lie Glu His Phe 
405 410 415 

cgt aat gag gca aac aat cag gaa ccg ttc tgg ttg cac age gga aaa 1296 
Arg Asn Glu Ala Asn Asn Gin Glu Pro Phe Trp Leu His Ser Gly Lys 
420 425 430 

cgt taa 1302 
Arg * 



<210> 279 
<211> 624 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (624) 



<400> 279 

gtg etc aac aaa etc tec tta ctg ctg aaa gac gca ggt att teg ctt 48 
Met Leu Asn Lys Leu Ser Leu Leu Leu Lys Asp Ala Gly lie Ser Leu 
1 5 10 15 

acc gat cac cag aaa aac cag ctt att gee tac gtg aat atg ctg cat 96 
Thr Asp His Gin Lys Asn Gin Leu lie Ala Tyr val Asn Met Leu His 
20 25 30 

aaa tgg aac aaa gcg tac aac ctg act teg gtc cgc gat cct aat gag 144 
Lys Trp Asn Lys Ala Tyr Asn Leu Thr Ser Val Arg Asp Pro Asn Glu 
35 40 45 

atg ctg gta cgc cat att etc gat age att gtg gtg gca ccg tat ctg 192 
Met Leu Val Arg His lie Leu Asp Ser lie Val Val Ala Pro Tyr Leu 
50 55 60 

caa ggt gaa egg ttt ate gat gtc ggc acc gga cca gga ctg cca ggc 240 
Gin Gly Glu Arg Phe He Asp Val Gly Thr Gly Pro Gly Leu Pro Gly 
65 70 75 80 

att cca etc tct ate gtg cgt cct gaa gee cat ttc act ctg ttg gat 288 
He Pro Leu Ser He Val Arg Pro Glu Ala His Phe Thr Leu Leu Asp 
85 90 95 

age ctt ggt aaa cgc gtg cgt ttc ctt cgt cag gtg caa cat gag ctt 336 
Ser Leu Gly Lys Arg Val Arg Phe Leu Arg Gin Val Gin His Glu Leu 
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100 105 110 

aaa ctg gag aat att gaa cca gta cag age agg gta gaa gag ttt cct 384 
Lys Leu Glu Asn lie Glu Pro Val Gin Ser Arg Val Glu Glu Phe Pro 
115 120 125 

tea gag ccg cca ttt gat ggc gta att age cgc get ttt gee tct ctg 432 
Ser Glu Pro Pro Phe Asp Gly Val He Ser Arg Ala Phe Ala Ser Leu 
130 135 140 

aac gat atg gtg age tgg tgc cac cat ctt cct ggt gag caa ggc cgt 480 
Asn Asp Met Val Ser Trp Cys His His Leu Pro Gly Glu Gin Gly Arg 
145 150 155 160 

ttc tac gcg ctg aaa ggg caa atg ccg gaa gat gaa ate get ttg ttg 528 
Phe Tyr Ala Leu Lys Gly Gin Met Pro Glu Asp Glu He Ala Leu Leu 
165 170 175 

ccc gaa gaa tat cag gtc gaa tea gtg gtt aaa ctt cag gtt cca gec 576 
Pro Glu Glu Tyr Gin Val Glu Ser Val Val Lys Leu Gin Val Pro Ala 
180 185 190 

ctg gat ggc gaa cgt cat ctg gtg gtg att aaa gca aat aaa att taa 624 
Leu Asp Gly Glu Arg His Leu Val Val lie Lys Ala Asn Lys He * 
195 200 205 



<210> 280 
<211> 1890 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1890) 

<400> 280 

atg ttt tat ccg gat cct ttt gac gtc ate ate att ggc ggg ggt cat 48 

Met Phe Tyr Pro Asp Pro Phe Asp Val He He He Gly Gly Gly His 
1 5 10 ^15 

gca ggc ace gag gec gcg atg gec gcg gcg cgt atg ggt caa cag act 96 
Ala Gly Thr Glu Ala Ala Met Ala Ala Ala Arg Met Gly Gin Gin Thr 
20 25 30 

ctg ctt ttg aca cac aat ate gac act ctg ggg cag atg age tgc aac 144 
Leu Leu Leu Thr His Asn He Asp Thr Leu Gly Gin Met Ser Cys Asn 
35 40 45 

ccg gcg ate ggc ggt att ggg aag gga cat ctg gta aaa gaa gtg gat 192 
Pro Ala He Gly Gly He Gly Lys Gly His Leu Val Lys Glu Val Asp 
50 55 60 

gca etc ggc ggt ctg atg gcg aaa gcg ate gat cag gcg ggt ate cag 240 
Ala Leu Gly Gly Leu Met Ala Lys Ala He Asp Gin Ala Gly He Gin 
65 70 75 80 

ttt agg ata eta aac gca age aaa gga ccg gcg gtt cgc get acc cga 288 
Phe Arg He Leu Asn Ala Ser Lys Gly Pro Ala Val Arg Ala Thr Arg 
85 90 95 
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get cag gcg gat cgt gtg etc tac cgt cag gcg gta cgt acg gcg ctg 336 
Ala Gin Ala Asp Arg Val Leu Tyr Arg Gin Ala Val Arg Thr Ala Leu 
100 105 110 

gag aac caa ccg aac ctg atg ate ttc cag cag gcg gtt gaa gat ctt 384 
Glu Asn Gin Pro Asn Leu Met lie Phe Gin Gin Ala Val Glu Asp Leu 
115 120 125 

att gtc gaa aac gat cgc gtg gtc ggt get gtt ace caa atg gga ctg 432 
He Val Glu Asn Asp Arg Val Val Gly Ala Val Thr Gin Met Gly Leu 
130 135 140 

aag ttc cgt gec aaa gee gtc gtg etc acc gtt ggg acg ttc etc gac 480 
Lys Phe Arg Ala Lys Ala Val Val Leu Thr Val Gly Thr Phe Leu Asp 
145 150 155 160 

ggt aaa att cat ate ggt ctg gat aat tac age ggt ggc cgt get ggt 528 
Gly Lys He His He Gly Leu Asp Asn Tyr Ser Gly Gly Arg Ala Gly 
165 170 175 

gat ccg ccg tec att ccg ctt tct cgc cgt ttg cgt gaa ctg ccg ctg 576 
Asp Pro Pro Ser He Pro Leu Ser Arg Arg Leu Arg Glu Leu Pro Leu 
180 185 190 

cgc gtt ggt cgt ctg aaa acc ggg aca cca ccg cgt att gat get cga 624 
Arg Val Gly Arg Leu Lys Thr Gly Thr Pro Pro Arg He Asp Ala Arg 
195 200 205 

acc ate gac ttt age gta ctg gcg caa cag cat ggc gat aac cca atg 672 
Thr He Asp Phe Ser Val Leu Ala Gin Gin His Gly Asp Asn Pro Met 
210 215 220 

ccg gta ttc teg ttt atg ggc aat gcg tec cag cat ccc cag cag gtg 720 
Pro Val Phe Ser Phe Met Gly Asn Ala Ser Gin His Pro Gin Gin Val 
225 230 235 240 

ccg tgt tat ate act cat acc aac gag aaa acc cat gat gtg ate cgc 768 
Pro Cys Tyr He Thr His Thr Asn Glu Lys Thr His Asp Val He Arg 
245 250 255 

agt aac etc gat cgt age cca atg tac gca ggg gtg ate gaa ggt gtc 816 
Ser Asn Leu Asp Arg Ser Pro Met Tyr Ala Gly Val He Glu Gly Val 
260 265 270 

ggc cca cgc tac tgc ccg teg ate gaa gac aaa gtc atg cgc ttc gec 864 
Gly Pro Arg Tyr Cys Pro Ser He Glu Asp Lys Val Met Arg Phe Ala 
275 280 ~ 285 

gac aga aat cag cat cag ate ttc ctt gaa ccg gaa gga ctg acc tct 912 
Asp Arg Asn Gin His Gin He Phe Leu Glu Pro Glu Gly Leu Thr Ser 
290 295 300 

aat gaa att tat ccg aac ggt ate tec acc age ctg ccg ttc gat gtg 960 
Asn Glu He Tyr Pro Asn Gly He Ser Thr Ser Leu Pro Phe Asp Val 
305 310 315 320 

cag atg caa ate gtc cgc tct atg cag ggg atg gaa aac gcg aag ate 1008 
Gin Met Gin He Val Arg Ser Met Gin Gly Met Glu Asn Ala Lys He 
325 330 335 
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gtg cgt ccg ggt tat gcc att gag tat gac ttc ttc gat cct cgc gac 1056 
Val Arg Pro Gly Tyr Ala lie Glu Tyr Asp Phe Phe Asp Pro Arg Asp 
340 345 350 

ctg aaa ccg acg ctg gag age aag ttt ate cag ggg ctg ttc ttt get 1104 
Leu Lys Pro Thr Leu Glu Ser Lys Phe He Gin Gly Leu Phe Phe Ala 
355 360 365 

ggt cag att aac ggc act acc ggt tac gaa gaa gcc get gcg caa ggt 1152 
Gly Gin He Asn Gly Thr Thr Gly Tyr Glu Glu Ala Ala Ala Gin Gly 
370 375 380 

ttg ctg gcc ggt ctt aac get gcc cgt ctg tct get gac aaa gaa ggt 1200 
Leu Leu Ala Gly Leu Asn Ala Ala Arg Leu Ser Ala Asp Lys Glu Gly 
385 390 395 400 

tgg get ccg gca cgt tct cag gcg tat etc ggc gta eta gtt gat gac 1248 
Trp Ala Pro Ala Arg Ser Gin Ala Tyr Leu Gly Val Leu Val Asp Asp 
405 410 415 

ctg tgc act tta gga acc aaa gaa ccg tat cgt atg ttt act teg cgc 1296 
Leu Cys Thr Leu Gly Thr Lys Glu Pro Tyr Arg Met Phe Thr Ser Arg 
420 425 430 

gca gaa tat cgt ctg atg eta cgc gaa gat aat gcg gat ctg cgt ttg 1344 
Ala Glu Tyr Arg Leu Met Leu Arg Glu Asp Asn Ala Asp Leu Arg Leu 
435 440 445 

act gaa ate ggt cgt gaa ctg ggc ctg gtg gat gac gaa cgt tgg gcg 1392 
Thr Glu He Gly Arg Glu Leu Gly Leu Val Asp Asp Glu Arg Trp Ala 
450 455 460 

cgc ttt aac gag aaa ctt gag aat ate gag cgt gag cgt cag cgt ctg 1440 
Arg Phe Asn Glu Lys Leu Glu Asn He Glu Arg Glu Arg Gin Arg Leu 
465 470 475 480 

aaa teg acc tgg gta acc ccg teg gcg gaa get gca gcc gaa gtg aat 1488 
Lys Ser Thr Trp Val Thr Pro Ser Ala Glu Ala Ala Ala Glu Val Asn 
485 490 495 

get cac ctg act gcg ccg ctt tec cgt gaa gcc agt ggt gaa gat ctg 1536 
Ala His Leu Thr Ala Pro Leu Ser Arg Glu Ala Ser Gly Glu Asp Leu 
500 505 510 

ctg cgt cgt ccg gaa atg act tat gaa aaa tta acc acg ctg acg ccg 1584 
Leu Arg Arg Pro Glu Met Thr Tyr Glu Lys Leu Thr Thr Leu Thr Pro 
515 520 525 

ttt gcc cct gcg ttg aca gac gaa cag gcg gcg gaa cag gtt gag att 1632 
Phe Ala Pro Ala Leu Thr Asp Glu Gin Ala Ala Glu Gin Val Glu He 
530 535 540 

cag gtt aaa tac gaa ggt tat ate gcg cgc cag caa gat gag ate gaa 1680 
Gin Val Lys Tyr Glu Gly Tyr He Ala Arg Gin Gin Asp Glu He Glu 
545 550 555 560 

aag cag ctg cgt aac gag aac acc ctg eta ccc gcg aca ctg gat tac 1728 
Lys Gin Leu Arg Asn Glu Asn Thr Leu Leu Pro Ala Thr Leu Asp Tyr 
565 570 575 

cgc cag gta tec ggt ctt tct aac gaa gtg ate gcc aaa ctt aac gat 1776 
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Arg Gin Val Ser Gly Leu Ser Asn Glu Val lie Ala Lys Leu Asn Asp 
580 585 590 

cac aaa cca gcc tct ate ggc caa get teg cgt att tct ggc gtc acg 1824 
His Lys Pro Ala Ser lie Gly Gin Ala Ser Arg lie Ser Gly Val Thr 
595 600 605 

cct gcg gcc ate tec att ctg ctg gtg tgg ctg aaa aaa cag ggt atg 1872 
Pro Ala Ala lie Ser lie Leu Leu Val Trp Leu Lys Lys Gin Gly Met 
610 615 620 

ctg cgt cgt age gca taa 1890 

Leu Arg Arg Ser Ala * 

625 



<210> 281 
<211> 444 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (444) 

<400> 281 

atg gca gat ate act ctt ate age ggc age ace etc ggc ggt gcc gaa 48 
Met Ala Asp He Thr Leu He Ser Gly Ser Thr Leu Gly Gly Ala Glu 
1 5 10 15 

tat gta gca gaa cac ctg get gaa aag ctg gaa gag gcg ggt ttt ace 96 
Tyr Val Ala Glu His Leu Ala Glu Lys Leu Glu Glu Ala Gly Phe Thr 
20 25 30 

acc gaa acg ctg cac ggt ccg ctg tta gaa gat tta cct gcc tea ggg 144 
Thr Glu Thr Leu His Gly Pro Leu Leu Glu Asp Leu Pro Ala Ser Gly 
35 40 45 

ate tgg ctg gtt ate age tec acc cac ggt gcc gga gat att ccg gac 192 
He Trp Leu Val He Ser Ser Thr His Gly Ala Gly Asp He Pro Asp 
50 55 60 

aac ctt tct cct ttc tat gaa gca ttg cag gaa cag aag ccc gat ctt 240 
Asn Leu Ser Pro Phe Tyr Glu Ala Leu Gin Glu Gin Lys Pro Asp Leu 
65 70 75 80 

tct gca gtc cgc ttt ggc gca ate ggt att ggc agt cgt gaa tat gac 288 
Ser Ala Val Arg Phe Gly Ala He Gly He Gly Ser Arg Glu Tyr Asp 
85 90 95 

acc ttt tgt ggg get ate gat aaa etc gag gca gaa etc aaa aat tec 336 
Thr Phe Cys Gly Ala He Asp Lys Leu Glu Ala Glu Leu Lys Asn Ser 
100 105 110 



ggt gca aaa cag aca ggc gaa aca ctg aag ate aac att ctt gat cac 384 
Gly Ala Lys Gin Thr Gly Glu Thr Leu Lys He Asn He Leu Asp His 
115 120 125 

gac att ccg gaa gat ccg gca gaa gaa tgg ctg gga teg tgg gtt aat 432 
Asp He Pro Glu Asp Pro Ala Glu Glu Trp Leu Gly Ser Trp Val Asn 
130 135 140 
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tta etc aaa taa 444 

Leu Leu Lys * 

145 



<210> 282 
<211> 456 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (456) 

<400> 282 

9 fc 9 ggt ca 9 c 9 a aac g fct tc g ct 9T a tg gag aaa aaa atg aaa aaa ggc 48 
Met Gly Gin Arg Asn Val Ser Leu Met Glu Lys Lys Met Lys Lys Gly 
1 5 10 15 

acc gtt ctt aat tct gat att tea teg gtg ate tec cgt ctg gga cat 96 
Thr Val Leu Asn Ser Asp lie Ser Ser Val lie Ser Arg Leu Gly His 
20 25 30 

acc gat acg ctg gtg gtg tgt gat get ggt tta ccc ate ccc aaa agt 144 
Thr Asp Thr Leu Val Val Cys Asp Ala Gly Leu Pro He Pro Lys Ser 
35 40 45 

aca acg cgt ate gat atg gca tta acc cag ggt gta cct tct ttt atg 192 
Thr Thr Arg He Asp Met Ala Leu Thr Gin Gly Val Pro Ser Phe Met 
50 55 60 

cag gtg ctg ggc gtc gtc aca aat gaa atg cag gtc gag gcg gec att 240 
Gin Val Leu Gly Val Val Thr Asn Glu Met Gin Val Glu Ala Ala He 
65 70 75 80 

ate gcg gaa gag ate aaa cac cat aat ccg caa etc cac gaa acg ttg 288 
He Ala Glu Glu He Lys His His Asn Pro Gin Leu His Glu Thr Leu 
85 90 95 

etc act cac ctt gag cag ctg caa aaa cac cag gga aat acc att gaa 336 
Leu Thr His Leu Glu Gin Leu Gin Lys His Gin Gly Asn Thr He Glu 
100 105 110 

att cgt tac acc acg cat gaa caa ttc aaa caa caa acc gca gaa agt 384 
He Arg Tyr Thr Thr His Glu Gin Phe Lys Gin Gin Thr Ala Glu Ser 
115 120 125 

cag gcg gta att cgc age gga gaa tgt tct ccg tat gcg aat ate att 432 
Gin Ala Val He Arg Ser Gly Glu Cys Ser Pro Tyr Ala Asn He He 
130 135 140 

etc tgt get ggc gtg acg ttc tga 456 
Leu Cys Ala Gly Val Thr Phe * 
145 150 



<210> 283 
<211> 1506 
<212> DNA 

<213> Escherichia coli 



-318- 



WO 01/48209 



PCT/US00/34419 



<220> 

<22X> CDS 

<222> (1) ... (1506) 



<400> 283 

atg gaa gca tta ctt cag ctt aaa 
Met Glu Ala Leu Leu Gin Leu Lys 
1 5 

gta aaa gcc etc teg ggc gca gcg 
Val Lys Ala Leu Ser Gly Ala Ala 
20 



ggc ate gat aaa gcc ttc ccg ggc 48 
Gly lie Asp Lys Ala Phe Pro Gly 
10 15 

tta aat gtc tat ccg ggc cgc gtg 96 
Leu Asn Val Tyr Pro Gly Arg Val 
25 30 



atg gcg ctg gtg ggc gaa aac ggc gcg ggt aaa tec ace atg atg aaa 144 
Met Ala Leu Val Gly Glu Asn Gly Ala Gly Lys Ser Thr Met Met Lys 
35 40 45 

gtg ctt act ggc ate tat act cgc gat gcc ggt acg ctt tta tgg ctg 192 
Val Leu Thr Gly He Tyr Thr Arg Asp Ala Gly Thr Leu Leu Trp Leu 
50 55 60 

ggg aaa gaa acg aca ttt acc ggg cca aaa tct tec cag gaa gcc ggg 240 
Gly Lys Glu Thr Thr Phe Thr Gly Pro Lys Ser Ser Gin Glu Ala Gly 
65 70 75 80 

att ggg att ate cat cag gaa ctg aac ctg ate ccg cag ttg acc att 288 
He Gly He He His Gin Glu Leu Asn Leu He Pro Gin Leu Thr He 
85 90 95 

gcc gaa aac att ttc etc ggt cgt gag ttt gtt aat cgc ttt ggc aaa 336 
Ala Glu Asn He Phe Leu Gly Arg Glu Phe Val Asn Arg Phe Gly Lys 
100 ' 105 110 

att gac tgg aaa acc atg tat gcc gaa gcg gat aaa ttg ctg get aaa 384 
He Asp Trp Lys Thr Met Tyr Ala Glu Ala Asp Lys Leu Leu Ala Lys 
115 120 125 

ctt aac ctg cgc ttt aaa age gac aag ctg gtg ggc gat ctt tec ate 432 
Leu Asn Leu Arg Phe Lys Ser Asp Lys Leu Val Gly Asp Leu Ser He 
130 135 140 



ggt gac cag caa atg gtt gaa ate gcc aaa gtg ctg age ttt gag teg 
Gly Asp Gin Gin Met Val Glu He Ala Lys Val Leu Ser Phe Glu Ser 
145 150 155 160 



480 



aaa gtc ate att atg gat gaa ccg acc gat gcg ctg acc gat acc gaa 528 
Lys Val He He Met Asp Glu Pro Thr Asp Ala Leu Thr Asp Thr Glu 
165 170 175 

acc gaa tec ctg ttc cgc gtc ate cgc gag ctg aaa teg caa ggc cgc 576 
Thr Glu Ser Leu Phe Arg Val He Arg Glu Leu' Lys Ser Gin Gly Arg 
180 185 190 

ggt att gtc tat ate tec cac cgc atg aaa gaa ate ttc gag att tgc 624 
Gly He Val Tyr He Ser His Arg Met Lys Glu He Phe Glu He Cys 
195 200 205 

gat gac gtt acc gtt ttt cgt gat ggg caa ttt att get gag cgc gaa 672 
Asp Asp Val Thr Val Phe Arg Asp Gly Gin Phe He Ala Glu Arg Glu 
210 215 220 
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gtg gca tea ctg acc gaa gat teg ctg att gag atg atg gtg ggt cgc 720 
Val Ala Ser Leu Thr Glu Asp Ser Leu He Glu Met. Met Val Gly Arg 
225 230 235 240 

aag ctg gaa gat caa tat ccg cac ctg gac aaa gcg ccg gga gat ate 768 
Lys Leu Glu Asp Gin Tyr Pro His lieu Asp Lys Ala Pro Gly Asp He 
245 250 255 

cgc ctg aaa gtc gat aat etc tgc gga cct ggc gtt aac gat gtc tct 816 
Arg Leu Lys Val Asp Asn Leu Cys Gly Pro Gly Val Asn Asp Val Ser 
260 265 270 

ttt act tta cgc aaa ggc gaa att ctt ggc gtc tct ggt ttg atg ggc 864 
Phe Thr Leu Arg Lys Gly Glu He Leu Gly Val Ser Gly Leu Met Gly 
275 280 285 

gca ggt cgt acc gaa ctg atg aaa gtg etc tac ggc gca eta ccg cgc 912 
Ala Gly Arg Thr Glu Leu Met Lys Val Leu Tyr Gly Ala Leu Pro Arg 
290 295 300 

acc age ggt tac gtc acc ctg gat ggg cat gaa gtc gtt acc cgt tea 960 
Thr Ser Gly Tyr Val Thr Leu Asp Gly His Glu Val Val Thr Arg Ser 
305 310 315 320 

ccg cag gat ggc ctg gca aac ggc att gtg tat ate tec gaa gac cgt 1008 
Pro Gin Asp Gly Leu Ala Asn Gly He Val Tyr He Ser Glu Asp Arg 
325 330 335 

aaa cgt gac ggt tta gtg ttg ggc atg tea gta aaa gag aac atg teg 1056 
Lys Arg Asp Gly Leu Val Leu Gly Met Ser Val Lys Glu Asn Met Ser 
340 345 350 

ctg aca gcg ctg cgc tac ttc age cgc get ggc ggc agt ttg aag cat 1104 
Leu Thr Ala Leu Arg Tyr Phe Ser Arg Ala Gly Gly Ser Leu Lys His 
355 360 365 

gec gat gaa cag cag get gtg agt gat ttc att cgt ctg ttt aat gtg 1152 
Ala Asp Glu Gin Gin Ala Val Ser Asp Phe He Arg Leu Phe Asn Val 
370 375 ^ 380 

aaa act ccg teg atg gaa cag gca att ggt ctg ctt tec ggt ggc aat 12 00 
Lys Thr Pro Ser Met Glu Gin Ala He Gly Leu Leu Ser Gly Gly Asn 
385 390 395 400 

cag caa aaa gtg gcg att gee cgc ggt ctg atg aca cgc ccc aaa gtg 1248 
Gin Gin Lys Val Ala He Ala Arg Gly Leu Met Thr Arg Pro Lys Val 
405 410 . 415 

ttg ate ctt gat gaa cct acc cgt ggc gta gat gtc ggc gcg aaa aaa 1296 
Leu He Leu Asp Glu Pro Thr Arg Gly Val Asp Val Gly Ala Lys Lys 
420 425 430 

gag ate tat caa ctg att aac cag ttc aaa gee gat ggc ttg age ate 1344 
Glu He Tyr Gin Leu He Asn Gin Phe Lys Ala Asp Gly Leu Ser He 
435 440 445 

att ctg gtg tea teg gag atg cca gaa gta tta ggc atg age gat cgc 1392 
He Leu Val Ser Ser Glu Met Pro Glu Val Leu Gly Met Ser Asp Arg 
450 455 460 
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ate ate gtc atg cat gaa ggg cat etc age ggg gaa ttt act cgt gag 1440 
lie He Val Met His Glu Gly His Leu Ser Gly Glu Phe Thr Arg Glu 
465 470 475 480 

cag gee acc cag gaa gtg tta atg get gee get gtg ggc aag ctt aat 1488 
Gin Ala Thr Gin Glu Val Leu Met Ala Ala Ala Val Gly Lys Leu Asn 
485 490 495 

cgc gtg aat cag gag taa 1506 
Arg Val Asn Gin Glu * 
500 



<210> 284 
<211> 966 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (966) 

<400> 284 

atg aca acc cag act gtc tct ggt cgc cgt tat ttc acg aaa gcg tgg 48 
Met Thr Thr Gin Thr Val Ser Gly Arg Arg Tyr Phe Thr Lys Ala Trp 
1 5 10 15 

ctg atg gag cag aaa teg ctt ate get ctg ctg gtg ctg ate gcg att 96 
Leu Met Glu Gin Lys Ser Leu He Ala Leu Leu Val Leu He Ala He 
20 25 30 

gtc teg acg tta age ccg aac ttt ttc acc ate aat aac tta ttc aat 144 
Val Ser Thr Leu Ser Pro Asn Phe Phe Thr He Asn Asn Leu Phe Asn 
35 40 45 

att etc cag caa acc tea gtg aac gee att atg gcg gtc ggg atg acg 192 
He Leu Gin Gin Thr Ser Val Asn Ala He Met Ala Val Gly Met Thr 
50 55 60 

ctg gtg ate ctg acg teg ggc ate gac tta teg gta ggt tct ctg ttg 240 
Leu Val He Leu Thr Ser Gly He Asp Leu Ser Val Gly Ser Leu Leu 
65 70 75 80 

gcg ctg acc ggc gca gtt get gca tct ate gtc ggc att gaa gtc aat 288 
Ala Leu Thr Gly Ala Val Ala Ala Ser lie Val Gly He Glu Val Asn 
85 90 95 

gcg ctg gtg get gtc get get get etc gcg tta ggt gee gca att ggt 336 
Ala Leu Val Ala Val Ala Ala Ala Leu Ala Leu Gly Ala Ala lie Gly 
100 105 110 

gcg gta acc ggg gtg att gta gcg aaa ggt cgc gtc cag gcg ttt ate 384 
Ala Val Thr Gly Val He Val Ala Lys Gly Arg Val Gin Ala Phe He 
115 120 125 

get acg ctg gtt atg atg ctt tta ctg cgc ggc gtg acc atg gtt tat 432 
Ala Thr Leu Val Met Met Leu Leu Leu Arg Gly Val Thr Met Val Tyr 
130 135 140 

acc aac ggt age cca gtg aat acc ggc ttt act gag aac gee gat ctg 480 
Thr Asn Gly Ser Pro Val Asn Thr Gly Phe Thr Glu Asn Ala Asp Leu 
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145 150 155 160 

ttt ggc tgg ttt ggt att ggt cgt ccg ctg ggc gta ccg acg cca gtc 528 
Phe Gly Trp Phe Gly lie Gly Arg Pro Leu Gly Val Pro Thr Pro Val 
165 170 175 

tgg ate atg ggg att gtc ttc etc gcg gec tgg tac atg ctg cat cac 576 
Trp lie Met Gly lie Val Phe Leu Ala Ala Trp Tyr Met Leu His His 
180 185 190 

acg cgt ctg ggg cgt tac ate tac gcg ctg ggc ggc aac gaa gcg gca 624 
Thr Arg Leu Gly Arg Tyr He Tyr Ala Leu Gly Gly Asn Glu Ala Ala 
195 200 205 

acg cgt ctt tct ggt ate aac gtc aat aaa ate aaa ate ate gtc tat 672 
Thr Arg Leu Ser Gly He Asn Val Asn Lys He Lys He He Val Tyr 
210 215 220 

tct ctt tgt ggt ctg ctg gca teg ctg gec ggg ate att gaa gtg gcg 720 
Ser Leu Cys Gly Leu Leu Ala Ser Leu Ala Gly He He Glu Val Ala 
225 230 235 240 

cgt etc tec tec gca caa ccc acg gcg ggg act ggc tat gag ctg gat 768 
Arg Leu Ser Ser Ala Gin Pro Thr Ala Gly Thr Gly Tyr Glu Leu Asp 
245 250 255 

get att get gcg gtg gtt ctg ggc ggt acg agt ctg gcg ggc gga aaa 816 
Ala He Ala Ala Val Val Leu Gly Gly Thr Ser Leu Ala Gly Gly Lys 
260 265 270 

ggt cgc att gtt ggg acg ttg ate ggc gca tta att ctt ggc ttc ctt 864 
Gly Arg He Val Gly Thr Leu He Gly Ala Leu He Leu Gly Phe Leu 
275 280 285 

aat aat gga ttg aat ttg tta ggt gtt tec tec tat tac cag atg ate 912 
Asn Asn Gly Leu Asn Leu Leu Gly Val Ser Ser Tyr Tyr Gin Met He 
290 295 300 

gtc aaa gcg gtg gtg att ttg ctg gcg gtg ctg gta gac aac aaa aag 960 
Val Lys Ala Val Val He Leu Leu Ala Val Leu Val Asp Asn Lys Lys 
305 310 315 320 

cag taa 966 
Gin * 



<210> 285 
<211> 891 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
* <222> (1) . . . (891) 

<400> 285 

atg aac atg aaa aaa ctg get acc ctg gtt tec get gtt gcg eta age 48 

Met Asn Met Lys Lys Leu Ala Thr Leu Val Ser Ala Val Ala Leu Ser 
15 10 15 
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gcc acc gtc agt gcg aat gcg atg gca aaa gac acc ate gcg ctg gtg 96 
Ala Thr Val Ser Ala Asn Ala Met Ala Lys Asp Thr lie Ala Leu Val 
20 25 30 

gtc tec acg ctt aac aac ccg ttc ttt gta teg ctg aaa gat ggc gcg 144 
Val Ser Thr Leu Asn Asn Pro Phe Phe Val Ser Leu Lys Asp Gly Ala 
35 40 45 

cag aaa gag gcg gat aaa ctt ggc tat aac ctg gtg gtg ctg gac tec 192 
Gin Lys Glu Ala Asp Lys Leu Gly Tyr Asn Leu Val Val Leu Asp Ser 
50 55 60 

cag aac aac ccg gcg aaa gag ctg gcg aac gtg cag gac tta acc gtt 240 
Gin Asn Asn Pro Ala Lys Glu Leu Ala Asn Val Gin Asp Leu Thr Val 
65 70 75 80 

cgc ggc aca aaa att ctg ctg att aac ccg acc gac tec gac gca gtg 288 
Arg Gly Thr Lys lie Leu Leu He Asn Pro Thr Asp Ser Asp Ala Val 
85 90 95 

ggt aat get gtg aag atg get aac cag gcg aac ate ccg gtt ate act 336 
Gly Asn Ala Val Lys Met Ala Asn Gin Ala Asn He Pro Val He Thr 
100 105 110 

ctt gac cgc cag gca acg aaa ggt gaa gtg gtg age cac att get tct 384 
Leu Asp Arg Gin Ala Thr Lys Gly Glu Val Val Ser His He Ala Ser 
115 120 125 

gat aac gta ctg ggc ggc aaa ate get ggt gat tac ate gcg aag aaa 432 
Asp Asn Val Leu Gly Gly Lys He Ala Gly Asp Tyr He Ala Lys Lys 
130 135 140 

gcg ggt gaa ggt gcc aaa gtt ate gag ctg caa ggc att get ggt aca 480 
Ala Gly Glu Gly Ala Lys Val He Glu Leu Gin Gly He Ala Gly Thr 
145 150 155 160 

tec gca gcc cgt gaa cgt ggc gaa ggc ttc cag cag gcc gtt get get 528 
Ser Ala Ala Arg Glu Arg Gly Glu Gly Phe Gin Gin Ala Val Ala Ala 
165 170 175 

cac aag ttt aat gtt ctt gcc age cag cca gca gat ttt gat cgc att 576 
His Lys Phe Asn Val Leu Ala Ser Gin Pro Ala Asp Phe Asp Arg He 
180 185 190 

aaa ggt ttg aac gta atg cag aac ctg ttg acc get cat ccg gat gtt 624 
Lys Gly Leu Asn Val Met Gin Asn Leu Leu Thr Ala His Pro Asp Val 
195 200 205 

cag get gta ttc gcg cag aat gat gaa atg gcg ctg ggc gcg ctg cgc 672 
Gin Ala Val Phe Ala Gin Asn Asp Glu Met Ala Leu Gly Ala Leu Arg 
210 215 220 

gca ctg caa act gcc ggt aaa teg gat gtg atg gtc gtc gga ttt gac 720 
Ala Leu Gin Thr Ala Gly Lys Ser Asp Val Met Val Val Gly Phe Asp 
225 230 235 240 

ggt aca ccg gat ggc gaa aaa gcg gtg aat gat ggc aaa eta gca gcg 768 
Gly Thr Pro Asp Gly Glu Lys Ala Val Asn Asp Gly Lys Leu Ala Ala 
245 250 255 

act ate get cag eta ccc gat cag att ggc gcg aaa ggc gtc gaa acc 816 
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Thr lie Ala Gin Leu Pro Asp Gin lie Gly Ala Lys Gly Val Glu Thr 
260 265 ' 270 

gca gat aaa gtg ctg aaa ggc gag aaa gtt cag get aag tat ccg gtt 864 
Ala Asp Lys Val Leu Lys Gly Glu Lys Val Gin Ala Lys Tyr Pro Val 
275 280 285 

gat ctg aaa ctg gtt gtt aag cag tag 891 
Asp Leu Lys Leu Val Val Lys Gin * 
290 295 



<210> 286 
<211> 930 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (930) 

<400> 286 

atg caa aac gca ggc age etc gtt gtt ctt ggc age att aat get gac 48 
Met Gin Asn Ala Gly Ser Leu Val Val Leu Gly Ser lie Asn Ala Asp 
15 10 15 

cac att ctt aat ctt caa tct ttt cct act cca ggc gaa ace gta ace 96 
His lie Leu Asn Leu Gin Ser Phe Pro Thr Pro Gly Glu Thr Val Thr 
20 25 30 

ggt aac cac tat cag gtt gca ttt ggc ggc aaa ggc gcg aat cag get 144 
Gly Asn His Tyr Gin Val Ala Phe Gly Gly Lys Gly Ala Asn Gin Ala 
35 40 45 

gtg get get ggg cgt age ggt gcg aat ate gcg ttt att gee tgt acg 192 
Val Ala Ala Gly Arg Ser Gly Ala Asn He Ala Phe He Ala Cys Thr 
50 55 60 

ggt gat gac age att ggt gag age gtt cgc cag cag etc gee act gat 240 
Gly Asp Asp Ser He Gly Glu Ser Val Arg Gin Gin Leu Ala Thr Asp 
65 70 75 80 

aac att gat att act ccg gtc age gtg ate aaa ggc gaa tea aca ggt 288 
Asn He Asp He Thr Pro Val Ser Val He Lys Gly Glu Ser Thr Gly 
85 90 95 

gtg gcg ctg att ttt gtt aat ggc gaa ggt gag aat gtc ate ggt att 336 
Val Ala Leu He Phe Val Asn Gly Glu Gly Glu Asn Val He Gly He 
100 105 110 

cat gee ggc get aat get gec ctt tec ccg gcg ctg gtg gaa gcg caa 384 
His Ala Gly Ala Asn Ala Ala Leu Ser Pro Ala Leu Val Glu Ala Gin 
115 120 125 

cgt gag cgt att gee aac gcg tea gca tta tta atg cag ctg gaa tea 432 
Arg Glu Arg He Ala Asn Ala Ser Ala Leu Leu Met Gin Leu Glu Ser 
130 135 140 

cca etc gaa agt gtg atg gca gcg gcg aaa ate gee cat caa aat aag 480 
Pro Leu Glu Ser Val Met Ala Ala Ala Lys He Ala His Gin Asn Lys 
145 150 155 160 
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act ate gtt gcg ctt aac ccg get ccg get cgc gaa ctt cct gac gaa 528 
Thr He Val Ala Leu Asn Pro Ala Pro Ala Arg Glu Leu Pro Asp Glu 
165 170 175 

ctg ctg gcg ctg gtg gac att att acg cca aac gaa acg gaa gca gaa 576 
Leu Leu Ala Leu Val Asp He He Thr Pro Asn Glu Thr Glu Ala Glu 
180 185 190 

aag etc acc ggt att cgt gtt gaa aat gat gaa gat gca gcg aag gcg 624 
Lys Leu Thr Gly He Arg Val Glu Asn Asp Glu Asp Ala Ala Lys Ala 
195 200 205 

gcg cag gta ctg cat gaa aaa ggt ate cgt act gta ctg att act tta 672 
Ala Gin Val Leu His Glu Lys Gly He Arg Thr Val Leu He Thr Leu 
210 215 220 

gga agt cgt ggt gta tgg get age gtg aat ggt gaa ggt cag cgc gtt 720 
Gly Ser Arg Gly Val Trp Ala Ser Val Asn Gly Glu Gly Gin Arg Val 
225 230 235 240 

cct gga ttc egg gtg cag get gtc gat acc att get gee gga gat acc 768 
Pro Gly Phe Arg Val Gin Ala Val Asp Thr He Ala Ala Gly Asp Thr. 

245 250 255 

ttt aac ggt gcg tta ate acg gca ttg ctg gaa gaa aaa cca ttg cca 816 
Phe Asn Gly Ala Leu He Thr Ala Leu Leu Glu Glu Lys Pro Leu Pro 
260 265 270 

gag gcg att cgt ttt gee cat get gee get gcg att gee gta aca cgt 864 
Glu Ala He Arg Phe Ala His Ala Ala Ala Ala He Ala Val Thr Arg 
275 280 285 

aaa ggc gca caa cct tec gta ccg tgg cgt gaa gag ate gac gca ttt 912 
Lys Gly Ala Gin Pro Ser Val Pro Trp Arg Glu Glu He Asp Ala Phe 
290 295 300 

tta gac agg cag agg tga 930 

Leu Asp Arg Gin Arg * 

305 



<210> 287 
<211> 38 
<212> PRT 

<213> Escherichia coli 
<400> 287 

Met Lys Val Arg Ala Ser Val Lys Lys Leu Cys Arg Asn Cys Lys He 

1 5 10 15 

Val Lys Arg Asp Gly Val He Arg Val He Cys Ser Ala Glu Pro Lys 

20 25 30 

His Lys Gin Arg Gin Gly 
35 



<210> 288 
<211> 443 
<212> PRT 

<213> Escherichia coli 
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<400> 288 



Met 


Ala 


Lys 


Gin 


Pro 


Gly 


Leu 


Asp 


Phe 


Gin 


Ser 


Ala 


Lys 


Gly 


Gly 


Leu 


1 








5 










10 










15 




Gly 


Glu 


Leu 


Lys 


Arg 


Arg 


Leu 


Leu 


Phe 


Val 


He 


Gly 


Ala 


Leu 


lie 


Val 








20 










25 










30 






Phe 


Arg 


He 


Gly 


Ser 


Phe 


He 


Pro 


He 


Pro 


Gly 


He 


Asp 


Ala 


Ala 


Val 






35 










40 










45 








Leu 


Ala 


Lys 


Leu 


Leu 


Glu 


Gin 


Gin 


Arg 


Gly 


Thr 


He 


He 


Glu 


Met 


Phe 




50 










55 










60 










Asn 


Met 


Phe 


Ser 


Gly 


Gly 


Ala 


Leu 


Ser 


Arg 


Ala 


Ser 


He 


Phe 


Ala 


Leu 


65 










70 










75 










80 


Gly 


He 


Met 


Pro 


Tyr 


He 


Ser 


Ala 


Ser 


He 


He 


He 


Gin 


Leu 


Leu 


Thr 










85 










90 










95 




Val 


Val 


His 


Pro 


Thr 


Leu 


Ala 


Glu 


He 


Lys 


Lys 


Glu 


Gly 


Glu 


Ser 


Gly 








100 










105 










110 






Arg 


Arg 


Lys 


He 


Ser 


Gin 


Tyr 


Thr 


Arg 


Tyr 


Gly 


Thr 


Leu 


Val 


Leu 


Ala 






115 










120 










125 








He 


Phe 


Gin 


Ser 


He 


Gly 


He 


Ala 


Thr 


Gly 


Leu 


Pro 


Asn 


Met 


Pro 


Gly 




130 










135 










140 










Met 


Gin 


Gly 


Leu 


Val 


He 


Asn 


Pro 


Gly 


Phe 


Ala 


Phe 


Tyr 


Phe 


Thr 


Ala 


145 










150 










155 










160 


Val 


Val 


Ser 


Leu 


Val 


Thr 


Gly 


Thr 


Met 


Phe 


Leu 


Met 


Trp 


Leu 


Gly 


Glu 










165 










170 










175 




Gin 


He 


Thr 


Glu 


Arg 


Gly 


He 


Gly 


Asn 


Gly 


He 


Ser 


He 


He 


He 


Phe 








180 










185 










190 






Ala 


Gly 


He 


Val 


Ala 


Gly 


Leu 


Pro 


Pro 


Ala 


He 


Ala 


His 


Thr 


He 


Glu 






195 










200 










205 








Gin 


Ala 


Arg 


Gin 


Gly 


Asp 


Leu 


His 


Phe 


Leu 


Val 


Leu 


Leu 


Leu 


Val 


Ala 




210 










215 










220 










Val 


Leu 


Val 


Phe 


Ala 


Val 


Thr 


Phe 


Phe 


Val 


Val 


Phe 


Val 


Glu 


Arg 


Gly 


225 








Val 


230 










235 










240 


Gin 


Arg 


Arg 


He 


Val 


Asn 


Tyr 


Ala 


Lys 


Arg 


Gin 


Gin 


Gly 


Arg 


Arg 










245 










250 










255 




Val 


Tyr 


Ala 


Ala 


Gin 


Ser 


Thr 


His 


Leu 


Pro 


Leu 


Lys 


Val 


Asn 


Met 


Ala 








260 










265 










270 






Gly 


Val 


He 


Pro 


Ala 


He 


Phe 


Ala 


Ser 


Ser 


He 


He 


Leu 


Phe 


Pro 


Ala 






275 










280 










285 








Thr 


lie 


Ala 


Ser 


Trp 


Phe 


Gly 


Gly 


Gly 


Thr 


Gly 


Trp 


Asn 


Trp 


Leu 


Thr 




290 










295 










300 










Thr 


He 


Ser 


Leu 


Tyr 


Leu 


Gin 


Pro 


Gly 


Gin 


Pro 


Leu 


Tyr 


Val 


Leu 


Leu 


305 










310 










315 










320 


Tyr 


Ala 


Ser 


Ala 


He 


He 


Phe 


Phe 


Cys 


Phe 


Phe 


Tyr 


Thr 


Ala 


Leu 


Val 










325 










330 










335 




Phe 


Asn 


Pro 


Arg 


Glu 


Thr 


Ala 


Asp 


Asn 


Leu 


Lys 


Lys 


Ser 


Gly 


Ala 


Phe 








340 










345 










350 






Val 


Pro 


Gly 


He 


Arg 


Pro 


Gly 


Glu 


Gin 


Thr 


Ala 


Lys 


Tyr 


He 


Asp 


Lys 






355 










360 










365 








val 


Met 


Thr 


Arg 


Leu 


Thr 


Leu 


Val 


Gly 


Ala 


Leu 


Tyr 


He 


Thr 


Phe 


He 




370 










375 










380 










Cys 


Leu 


He 


Pro 


Glu 


Phe 


Met 


Arg 


Asp 


Ala 


Met 


Lys 


Val 


Pro 


Phe 


Tyr 


385 










390 










395 










400 


Phe 


Gly 


Gly 


Thr 


Ser 


Leu 


Leu 


He 


Val 


Val 


Val 


Val 


He 


Met 


Asp 


Phe 










405 










410 










415 




Met 


Ala 


Gin 


Val 


Gin 


Thr 


Leu 


Met 


Met 


Ser 


Ser 


Gin 


Tyr 


Glu 


Ser 


Ala 








420 










425 










430 






Leu 


Lys 


Lys 


Ala 


Asn 


Leu 


Lys 


Gly 


Tyr 


Gly 


Arg 













435 440 



<210> 289 
<211> 144 
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<212> PRT 

<213> Escherichia coli 



<400> 289 



Met Arg 


Leu Asn Thr Leu 


Ser Pro 


Ala 


Glu Gly Ser Lys 


Lys 


Ala 


Gly 


1 


5 






10 




15 




Lys Arg 


Leu Gly Arg Gly 


lie Gly Ser 


Gly Leu Gly Lys 


Thr 


Gly 


Gly 




20 




25 




30 






Arg Gly 


His Lys Gly Gin 


Lys Ser Arg 


Ser Gly Gly Gly 


Val 


Arg 


Arg 




35 


40 




45 








Gly Phe 


Glu Gly Gly Gin 


Met Pro 


Leu 


Tyr Arg Arg Leu 


Pro 


Lys 


Phe 


50 




55 




60 








Gly Phe 


Thr Ser Arg Lys 


Ala Ala 


He 


Thr Ala Glu He 


Arg 


Leu 


Ser 


65 


70 






75 






80 


Asp Leu 


Ala Lys Val Glu 


Gly Gly Val 


Val Asp Leu Asn 


Thr 


Leu 


Lys 




85 






90 




95 




Ala Ala 


Asn lie lie Gly 


lie Gin 


He 


Glu Phe Ala Lys 


Val 


He 


Leu 




100 




105 




110 






Ala Gly 


Glu Val Thr Thr 


Pro Val 


Thr 


Val Arg Gly Leu 


Arg 


Val 


Thr 




115 


120 




125 








Lys Gly 


Ala Arg Ala Ala 


lie Glu 


Ala 


Ala Gly Gly Lys 


He 


Glu 


Glu 


130 




135 




140 








<210> 290 














<211> 59 














<212> PRT 














<213> Escherichia coli 














<400> 290 














Met Ala Lys Thr lie Lys 


lie Thr 


Gin 


Thr Arg Ser Ala 


He 


Gly 


Arg 


1 


5 






10 




15 




Leu Pro Lys His Lys Ala 


Thr Leu 


Leu 


Gly Leu Gly Leu 


Arg 


Arg 


He 




20 




25 




30 






Gly His Thr Val Glu Arg 


Glu Asp Thr 


Pro Ala He Arg 


Gly 


Met 


He 




35 


40 




45 








Asn Ala Val Ser Phe Met 


Val Lys Val 


Glu Glu 








50 




55 












<210> 291 














<211> 167 














<212> PRT 














<213> Escherichia coli 














<400> 291 














Met Ala 


His ' lie Glu Lys 


Gin Ala Gly 


Glu Leu Gin Glu 


Lys 


Leu 


He 


1 


5 






10 




15 




Ala val 


Asn Arg Val Ser 


Lys Thr Val 


Lys Gly Gly Arg 


He 


Phe 


Ser 




20 




25 










Phe Thr 


Ala Leu Thr Val 


Val Gly Asp 


Gly Asn Gly Arg 


Val 


Gly 


Phe 




35 


40 




45 








Gly Tyr 


Gly Lys Ala Arg 


Glu Val 


Pro 


Ala Ala He Gin 


Lys 


Ala 


Met 


50 




55 




60 








Glu Lys 


Ala Arg Arg Asn 


Met lie 


Asn 


Val Ala Leu Asn 


Asn 


Gly 


Thr 


65 


70 






75 






80 


Leu Gin 


His Pro Val Lys 


Gly Val His 


Thr Gly Ser Arg 


Val 


Phe 


Met 




85 






90 




95 




Gin Pro 


Ala Ser Glu Gly 


Thr Gly lie 


He Ala Gly Gly 


Ala 


Met 


Arg 




100 




105 




110 






Ala Val 


Leu Glu Val Ala 


Gly Val His 


Asn Val Leu Ala 


Lys 


Ala 


Tyr 
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115 120 125 

Gly Ser Thr Asn Pro He Asn Val Val Arg Ala Thr He Asp Gly Leu 

130 135 140 

Glu Asn Met Asn Ser Pro Glu Met Val Ala Ala Lys Arg Gly Lys Ser 
145 150 155 " ISO 

Val Glu Glu He Leu Gly Lys 
165 



<210> 292 
<211> 117 
<212> PRT 

<213> Escherichia coli 
<400> 292 



Met 


Asp 


Lys 


Lys 


Ser 


Ala Arg He 


Arg Arg Ala Thr 


Arg Ala Arg Arg 


1 








5 




10 


15 


Lys 


Leu 


Gin 


Glu 


Leu 


Gly Ala Thr 


Arg Leu Val Val 


His Arg Thr Pro 








20 






25 


30 


Arg 


His 


He 


Tyr 


Ala 


Gin Val He 


Ala Pro Asn Gly 


Ser Glu Val Leu 






35 






40 




45 


Val 


Ala 


Ala 


Ser 


Thr 


Val Glu Lys 


Ala He Ala Glu 


Gin Leu Lys Tyr 




50 








55 


60 




Thr 


Gly 


Asn 


Lys 


Asp 


Ala Ala Ala 


Ala Val Gly Lys 


Ala Val Ala Glu 


65 










70 


75 


SO 


Arg 


Ala 


Leu 


Glu 


Lys 


Gly He Lys 


Asp Val Ser Phe 


Asp Arg Ser Gly 










85 




90 


95 


Phe 


Gin 


Tyr 


His 


Gly 


Arg Val Gin 


Ala Leu Ala Asp 


Ala Ala Arg Glu 








100 






105 


110 


Ala 


Gly 


Leu 


Gin 


Phe 












115 













<210> 293 
<211> 177 
<212> PRT 

<213> Escherichia coli 



<400> 293 



Met 


Ser 


Arg 


Val 


Ala 


Lys Ala Pro 


Val Val Val Pro 


Ala Gly Val Asp 


l 








5 




10 


15 


Val 


Lys 


He 


Asn 


Gly 


Gin Val He 


Thr He Lys Gly 


Lys Asn Gly Glu 








20 






25 


30 


Leu 


Thr 


Arg 


Thr 


Leu 


Asn Asp Ala 


Val Glu Val Lys 


His Ala Asp Asn 






35 






40 




45 


Thr 


Leu 


Thr 


Phe 


Gly 


Pro Arg Asp 


Gly Tyr Ala Asp 


Gly Trp Ala Gin 




50 








55 


60 




Ala 


Gly 


Thr 


Ala 


Arg 


Ala Leu Leu 


Asn Ser Met Val 


He Gly Val Thr 


65 










70 


75 


80 


Glu 


Gly 


Phe 


Thr 


Lys 


Lys Leu Gin 


Leu Val Gly Val 


Gly Tyr Arg Ala 










85 




90 


95 


Ala 


Val 


Lys 


Gly 


Asn 


Val He Asn 


Leu Ser Leu Gly 


Phe Ser His Pro 








100 






105 


110 


Val 


Asp 


His 


Gin 


Leu 


Pro Ala Gly 


He Thr Ala Glu 


Cys Pro Thr Gin 






115 






120 




125 


Thr 


Glu 


He 


Val 


Leu 


Lys Gly Ala 


Asp Lys Gin Val 


He Gly Gin Val 




130 








135 


140 




Ala 


Ala 


Asp 


Leu 


Arg 


Ala Tyr Arg 


Arg Pro Glu Pro 


Tyr Lys Gly Lys 


145 










150 


155 


160 


Gly 


Val 


Arg 


Tyr 


Ala 


Asp Glu Val 


Val Arg Thr Lys 


Glu Ala Lys Lys 










165 




170 


175 
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Lys 



<210> 294 
<211> 130 
<212> PRT 

<213> Escherichia coli 
<400> 294 



Met 


Ser 


Met 


Gin 


Asp 


Pro He Ala Asp 


Met Leu Thr Arg He Arg Asn 


1 








5 




10 15 


Gly 


Gin 


Ala 


Ala 


Asn 


Lys Ala Ala Val 


Thr Met Pro Ser Ser Lys Leu 








20 




25 


30 


Lys 


Val 


Ala 


He 


Ala 


Asn Val Leu Lys 


Glu Glu Gly Phe He Glu Asp 






35 






40 


45 


Phe 


Lys 


Val 


Glu 


Gly 


Asp Thr Lys Pro 


Glu Leu Glu Leu Thr Leu Lys 




50 








55 


60 


Tyr 


Phe 


Gin 


Gly 


Lys 


Ala Val Val Glu 


Ser He Gin Arg Val Ser Arg 


65 










70 


75 80 


Pro 


Gly 


Leu 


Arg 


He 


Tyr Lys Arg Lys 


Asp Glu Leu Pro Lys Val Met 










85 




90 95 


Ala 


Gly 


Leu 


Gly 


He 


Ala Val Val Ser 


Thr Ser Lys Gly Val Met Thr 








100 




105 


110 


Asp 


Arg 


Ala 


Ala 


Arg 


Gin Ala Gly Leu 


Gly Gly Glu He He Cys Tyr 






115 






120 


125 


Val 


Ala 














130 













<210> 295 
<211> 101 
<212> PRT 

<213> Escherichia coli 
<400> 295 



Met 


Ala 


Lys 


Gin 


Ser Met Lys Ala Arg 


Glu Val Lys Arg Val Ala Leu 


1 








5 


10 15 


Ala 


Asp 


Lys 


Tyr 


Phe Ala Lys Arg Ala 


Glu Leu Lys Ala He He Ser 








20 


25 


30 


Asp 


val 


Asn 


Ala 


Ser Asp Glu Asp Arg 


Trp Asn Ala Val Leu Lys Leu 






35 




40 


45 


Gin 


Thr 


Leu 


Pro 


Arg Asp Ser Ser Pro 


Ser Arg Gin Arg Asn Arg Cys 




50 






55 


60 


Arg 


Gin 


Thr 


Gly 


Arg* Pro His Gly Phe 


Leu Arg Lys Phe Gly Leu Ser 


65 








70 


75 80 


Arg 


He 


Lys 


Val 


Arg Glu Ala Ala Met 


Arg Gly Glu He Pro Gly Leu 










85 


90 95 


Lys 


Lys 


Ala 


Ser 


Trp 










100 







<210> 296 
<211> 179 
<212> PRT 

<213> Escherichia coli 
<400> 296 

Met Ala Lys Leu His Asp Tyr Tyr Lys Asp Glu Val Val Lys Lys lieu 

1 5 10 15 

Met Thr Glu Phe Asn Tyr Asn Ser Val Met Gin Val Pro Arg Val Glu 
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20 






25 30 


Lys 


He 


Thr 


Leu 


Asn 


Met Gly Val 


Gly Glu Ala He Ala Asp Lys Lys 






35 






40 


45 


Leu 


Leu 


Asp 


Asn 


Ala 


Ala Ala Asp 


Leu Ala Ala He Ser Gly Gin Lys 




50 








55 


60 


Pro 


Leu 


He 


Thr 


Lys 


Ala Arg Lys 


Ser Val Ala Gly Phe Lys He Arg 


65 










70 


75 80 


Gin 


Gly 


Tyr 


Pro 


He 


Gly Cys Lys 


Val Thr Leu Arg Gly Glu Arg Met 










85 




90 95 


Trp 


Glu 


Phe 


Phe 


Glu 


Arg Leu He 


Thr He Ala Val Pro Arg He Arg 








100 






105 110 


Asp 


Phe 


Arg 


Gly 


Leu 


Ser Ala Lys 


Ser Phe Asp Gly Arg Gly Asn Tyr 






115 






120 


125 
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Gly 
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390 




Asn 


Asn 


Ser 
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<210> 304 
<211> 1325 
<212> PRT 

<213> Escherichia coli 
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Val 
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Lys 
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Glu 
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230 






Ser 


He 
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Lys 


Gly 


He 


Val 




530 










535 




Asn 


Leu 
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Lys 


Gly 


Tyr 


Val 


Thr 










205 








Tyr 


Gin 


Ala 


Gly 


Ser 


Asn 


Gly 


Gin 








220 
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Val 
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Val 
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430 
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Lys 


Ala 






475 
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He 


Val 


Asn 


Val 
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Phe 


lie 
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Gly 










525 








Asn 


He 


Ser 


Thr 


Asp 


Ser 


Leu 


Trp 
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Val 
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Val Asp 


Gly Gin 


Asn 


Ser 


Leu 


Leu 


Glu 


Thr 


Phe Asn 


Met 


Tyr 


vai Giy 




595 








oUU 








enc 
bUo 






Thr Ser 


Gly Thr 


Gly 


Thr 


Leu 


Thr 


Leu 


Thr 


Asn Asn 


Giy 


inr 


L6U Asn 


610 








615 
















Val Glu 


Gly Gly 


Glu 


Val 


Tyr 


Leu 


Gly 


val 


TYV* a Pin 

Pne biu 


Pro 


Aid 


Val Giy 


625 






con 


















Thr Leu 


Asn He 


Gly 


Ala 


Ala 


HIS 


Gly 


G1U 


Ala Ala 


TV 1 

Aid 


Asp 


Aid vsiy 
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Phe lie 


Thr Asn 


Ala 


Thr 


Lys 


Val 


Glu 


Pne 


Gly Leu 


Giy 


GlU 


Giy vai 
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c c c 
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Phe Asn 


His 


Thr 


Asn 


Asn 


Ser 


Asp 


Ala Gly 


Tyr 


Gin 


val Asp 
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Met Leu 


He Thr 


Gly 


Asp 


Asp 


Lys 


Asp 


Gly 


Lys Val 


He 


His 


Asp Ala 


690 
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Gly His 


Thr Val 


Phe 


Asn 


Ala 


Gly 


Asn 


Thr 


Tyr Ser 


Gly 


Lys 


Thr Leu 
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TO n 
/ Z. U 


Val Asn 


Asp Gly 


Leu 


Leu 


Thr 


He 


Ala 


Ser 


His Thr 


Ala 
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Gly Val 






725 
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Thr Gly 


Met Gly 
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Ser 


Glu 
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Thr 


He 


Ala Asn 
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Gly 


Thr Leu 
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n tr rv 
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•Asp lie 
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Thr 


Asn 


Ser 


Ala 


Gly 


Asp Tyr 


Thr 


Leu 


Thr Asn 
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Ala Leu 
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Leu 


Met 
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Val 


Gin lieu 


Ser 


Ser 


Ser Asp 
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Lys Met 


Phe Gly 


Phe 


Thr 


His 


Ala 


Thr 


Gly 


Thr Glu 


Phe 


TV 1 _ 

Ala 


Gly Val 
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790 










795 






SOU 


Ala Gin 


Leu Lys 


Asp 


Ser 


Thr 


Phe 


Thr 


Leu 


Glu Arg 


Asp 


Asn 


Thr Ala 
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olb 


Ala Leu 


Thr His 


Ala 


Met 


Leu 


Gin 


Ser 


Asp 


Ser Glu 


Asn 


Thr 


Thr Ser 




820 










825 








ojO 




Val Lys 


Val Gly 


Glu 


Gin 


Ser 


He 


Gly 


Gly 


Leu Ala 


Met 


Asn 


Gly Gly 




835 








840 








O A C 

84b 






Thr lie 


He Phe 


Asp 


Thr 


Asp 


He 


Pro 


Ala 


Ala Thr 


Leu 


Ala 


Glu Gly 


850 








855 








Q C fS 

860 








Tyr lie 


Ser Val 


Asp 


Thr 


Leu 


Val 


Val 


Gly 


Ala Gly 


Asp 


Tyr 


Thr Trp 


865 






870 










o fo 






oou 


Lys Gly 


Arg Asn 


Tyr 


Gin 


Val 


Asn 


Gly 


Thr 


Gly Asp 


Val 


Leu 


He Asp 






885 










890 








©yb 


Val Pro 


Lys Pro 


Trp 


Asn 


Asp 


Pro 


Met 


Ala 


Asn Asn 


Pro 


Leu 


Thr Thr 




900 










905 








910 




Leu Asn 


Leu Leu 


Glu 


His 


Asp 


Asp 


Ser 


His 


Val Gly 


Val 


Gin 


Leu Val 




915 








920 








y^b 






Lys Ala 


Gin Thr 


Val 


He 


Gly 


Ser 


Gly 


Gly 


Ser Leu 


Thr 


Leu 


Arg Asp 


930 








935 








y^u 








Leu Gin 


Gly Asp 


Glu 


Val 


GlU 


Ala 


Asp 


Lys 


Thr Leu 


His 


He 


7V "I ~ fin _ 

Aia Gin 


945 






you 


















Asn Gly 


Thr Val 


Val 


Ala 


Glu 


Gly 


Asp 


Tyr 


Gly Phe 


Arg 


Leu 


inr inr 






965 










y /u 








y / d 


Ala Pro 


Gly Asn 


Gly 


Leu 


Tyr 


Val 


Asn 


Tyr 


Gly Leu 


Lys 


Ala 


T All ^ 

ijeu Asn 




980 










.985 








990 




lie His 


Gly Gly 


Gin 


Lys 


Leu 


Thr 


Leu 


Ala 


Glu His 


Gly 


Gly 


Ala Tyr 




995 








1000 






1005 




Gly Ala 


Thr Ala 


Asp 


Met 


Ser 


Ala Lys 


He 


Gly Gly 


Glu Gly 


Asp Leu 


1010 






1015 






1020 






Ala He 


Asn Thr 


Val 


Arg 


Gin Val 


Ser 


Leu 


Ser Asn Gly Gin 


Asn Asp 


1025 






1030 








1035 






1040 


Tyr Gin Gly Ala 


Thr 


Tyr Val Gin Met 


Gly 


Thr Leu Arg Thr 


Asp Ala 






1045 








1050 






1055 


Asp Gly Ala Leu 


Gly Asn Thr Arg Glu 


Leu Asn He 


Ser 


Asn 


Ala Ala 




1060 








1065 






1070 


He Val Asp Leu Asn Gly Ser Thr Gin Thr Val Glu Thr Phe Thr Gly 
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1075 1080 1085 

Gin Met Gly Ser Thr Val Leu Phe Lys Glu Gly Ala Leu Thr Val Asa 

1090 1095 1100 

Lys Gly Gly lie Ser Gin Gly Glu Leu Thr Gly Gly Gly Asa Leu Asn 
1105 1110 1115 1120 

Val Tor Gly Gly Thr Leu Ala lie Glu Gly Leu Asn Ala Arg Tyr Asn 

1125 1130 1135 

Ala Leu Thr Ser lie Ser Pro Asn Ala Glu Val Ser Leu Asp Asn Thr 

1140 1145 1150 

Gin Gly Leu Gly Arg Gly Asn lie Ala Asn Asp Gly Leu Leu Thr Leu 

1155 1160 1165 

Lys Asn Val Thr Gly Glu Leu Arg Asn Ser He Ser Gly Lys Gly He 

1170 1175 1180 

Val Ser Ala Thr Ala Arg Thr Asp Val Glu Leu Asp Gly Asp Asn Ser 
1185 1190 1195 1200 

Arg Phe Val Gly Gin Phe Asn He Asp Thr Gly Ser Ala Leu Ser Val 

1205 1210 1215 

Asn Glu Gin Lys Asn Leu Gly Asp Ala Ser Val He Asn Asn Gly Leu 

1220 1225 1230 

Leu Thr He Ser Thr Glu Arg Ser Trp Ala Met Thr His Ser He Ser 

1235 1240 1245 

Gly Ser Gly Asp Val Thr Lys Leu Gly Thr Gly He Leu Thr Leu Asn 

1250 1255 1260 

Asn Asp Ser Ala Ala Tyr Gin Gly Thr Thr Asp He Val Gly Gly Glu 
1265 1270 1275 1280 

He Ala Phe Gly Ser Asp Ser Ala He Asn Met Ala Ser Gin His He 

1285 1290 1295 

Asn He His Asn Ser Gly Val Met Ser Gly Asn Val Thr Thr Ala Gly 

1300 1305 1310 

Asp Met Asn Val Met Pro Gly Gly Gly Thr Ala Cys Arg 
1315 1320 1325 



<210> 305 
<211> 251 
<212> PRT 

<213> Escherichia coli 
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Ala 


Gin 
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Val Glu Arg Leu 


Gly 


He 


Ser 
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Ala 
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Arg 
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Gly 


Lys 
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Lys 


Val 
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Gly Ala 


Glu 


Ala He Thr Gin 


Gin 


Arg 


Pro 
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Met 
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Gin 
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Asp 


Glu 


Lys 
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Val 


Arg 


He 


Ala 


Lys 


Ala 


Ala Ser 


Gin 


Leu Val Asn Pro 


Gly 


Glu 


Ser 
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Val 


Val 


He 


Asn 


Cys 


Gly 


Ser Thr 


Ala 


Phe Leu Leu Gly 


Arg 


Glu 


Met 
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105 




110 






Cys 


Gly 


Lys 


Pro 


Val 


Gin 


He He 


Thr 


Asn Tyr Leu Pro 


Leu 


Ala 


Asn 






115 








120 




125 








Tyr 


Leu 


He 


Asp 


Gin 


Glu 


His Asp 


Ser 


Val He He Met 


Gly 


Gly 


Gin 




130 










135 




140 








Tyr 


Asn 


Lys 


Ser 


Gin 


Ser 


He Thr 


Leu 


Ser Pro Gin Gly 


Ser 


Glu 


Asn 


145 










150 






155 






160 


Ser 


Leu 


Tyr 


Ala 


Gly 


His 


Trp Met 


Phe 


Thr Ser Gly Lys 


Gly 


Leu 


Thr 










165 








170 




175 
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Ala Glu Gly Leu Tyr Lys Thr Asp 
180 

Lys Met Leu Ser Val Val Gly Lys 
195 200 
Lys lie Gly Glu Arg Ala Gly Met 

210 215 
Asp Met Leu lie Thr Gly Lys Asn 
225 230 
Leu Glu Ala Gin Gly Val Ser He 
245 



Met Leu Thr Ala Met Ala Glu Gin 
185 190 
Leu Val Val Leu Val Asp Ser Ser 
205 

Leu Phe Ser Arg Ala Asp Gin He 
220 

Ala Asn Pro Glu He Leu Gin Gin 
235 240 
Leu Arg Val 
250 



<210> 306 
<211> 274 
<212> PRT 

<213> Escherichia coli 



<400> 306 














Met 


Thr 


Glu 


Phe 


Thr 


Thr 


Leu Leu Gin Gin 


Gly Asn Ala 


Trp Phe Phe 
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He 


Pro 


Ser 


Ala 


He 


Leu 


Leu Gly Ala Leu 


His Gly Leu 


Glu Pro Gly 
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30 
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Ser 


Lys 


Thr 


Met 


Met 


Ala Ala Phe He 


He Ala He 


Lys Gly Thr 






35 








40 


45 




He 


Lys 


Gin 


Ala 


Val 


Met 


Leu Gly Leu Ala 


Ala Thr He 


Ser His Thr 




50 










55 


60 




Ala 


Val 


Val 


Trp 


Leu 


He 


Ala Phe Gly Gly 


Met Val He 


Ser Lys Arg 


65 










70 




75 


80 


Phe 


Thr 


Ala 


Gin 


Ser 


Ala 


Glu Pro Trp Leu 


Gin Leu He 


Ser Ala Val 










85 




90 




95 


He 


He 


He 


Ser 


Thr 


Ala 


Phe Trp Met Phe 


Trp Arg Thr 


Trp Arg Gly 








100 






105 




110 


Glu 


Arg 


Asn 


Trp 


Leu 


Glu 


Asn Met His Gly 


His Asp* Tyr 


Glu His His 






115 








120 


125 




His 


His 


Asp 


His 


Glu 


His 


His His Asp His 


Gly His His 


His His His 




130 










135 


140 




Glu 


His 


Gly 


Glu 


Tyr 


Gin 


Asp Ala His Ala 


Arg Ala His 


Ala Asn Asp 


145 










150 




155 


160 


He 


Lys 


Arg 


Arg 


Phe 


Asp 


Gly Arg Glu Val 


Thr Asn Trp 


Gin He Leu 










165 




170 




175 


Leu 


Phe 


Gly 


lieu 


Thr 


Gly 


Gly Leu He Pro 


Cys Pro Ala 


Ala He Thr 








180 






185 




190 


Val 


Leu 


Leu 


He 


Cys 


He 


Gin Leu Lys Ala 


Leu Thr Leu 


Gly Ala Thr 






195 








200 


205 




Leu 


Val 


Val 


Ser 


Phe 


Ser 


He Gly Leu Ala 


Leu Thr Leu 


Val Thr Val 




210 










215 


. 220 




Gly 


Val 


Gly 


Ala 


Ala 


He 


Ser Val Gin Gin 


Val Ala Lys 


Arg Trp Ser 


225 










230 




235 


240 


Gly 


Phe 


Asn 


Thr 


Leu 


Ala 


Lys Arg Ala Pro 


Tyr Phe Ser 


Ser Leu Leu 










245 




250 




255 


He 


Gly 


Leu 


Val 


Gly 


Val 


Tyr Met Gly Val 


His Gly Phe 


Met Gly He 








260 






265 




270 



Met Arg 



<210> 307 
<211> 172 
<212> PRT 
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<400> 307 



Met 


He 


Leu 


Lys 


Ser 


Ala 


He Ser Ala Asp 


Ser 


Leu 


Leu 


Ala 
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Arg 


Ala 


Ser 
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Leu 


Arg 


Asn 


His 


Gly 


He 
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<211> 344 
<212> PRT 
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He Asp Lys Gin Gin Gly Leu Ser Cys Thr He Thr Phe Gly Lys Glu 

805 810 815 

He Asp Glu Ser Arg Asn Tyr He Cys Gin 
820 825 



<210> 310 
<211> 239 
<212> PRT 

<213> Escherichia coli 



<400> 310 
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<210> 311 
<211> 180 
<212> PRT 

<213> Escherichia coli 
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<210> 312 
<211> 95 
<212> PRT 

<213> Escherichia coli 
<400> 312 

Met Leu His Thr lieu His Arg Ser Pro Trp Leu Thr Asp Phe Ala Ala 

1 5 10 15 

Leu Leu Arg Leu Leu Ser Glu Gly Asp Glu Leu Leu Leu Leu Gin Asp 

20 25 30 

Gly Val Thr Ala Ala Val Asp Gly Asn Arg Tyr Leu Glu Ser Leu Arg 

35 40 45 

Asn Ala Pro He Lys Val Tyr Ala Leu Asn Glu Asp Leu He Ala Arg 

50 55 60 

Gly Leu Thr Gly Gin He Ser Asn Asp He He Leu He Asp Tyr Thr 
65 70 75 80 

Asp Phe Val Arg Leu Thr Val Lys His Pro Ser Gin Met Ala Trp 
85 90 95 



<210> 313 
<211> 119 
<212> PRT 

<213> Escherichia coli 
<400> 313 



Met 


Lys 


Arg 


He 


Ala 


Phe 


Val Phe Ser Thr 


Ala Pro His Gly Thr Ala 


1 








5 




10 


15 


Ala 


Gly 


Arg 


Glu 


Gly 


Leu 


Asp Ala Leu Leu 


Ala Thr Ser Ala Leu Thr 








20 






25 


30 


Asp 


Asp 


Leu 


Ala 


Val 


Phe 


Phe He Ala Asp 
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Glu 
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<212> PRT 

<213> Escherichia coli 
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Tyr Asn Ala Asn 






35 








. 40 


45 


Gin 


Leu 


Thr 


Ser 
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Val Arg Ala Trp 
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Ala Gly Arg Leu 
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Gly 


Leu 


Ala 


Ser 


Ser 
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Leu Gin Gin Gly Phe Thr 


Leu Ser Gly Leu 
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Gly 
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Val Val Gin Phe 
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<210> 315 
<211> 244 
<212> PRT 

<213> Escherichia coli 



<400> 315 
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Met 
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Glu Thr Ser Glu 
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Leu 


Asp 
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Asp 


Gin 


Arg Pro Phe Asp Gin Thr 


Asp Phe Asp He 








20 






25 


30 


Leu 


Lys 


Ser 


Tyr 


Glu 


Ala 


Val Val Asp Gly Leu Ala 


Met Leu He Gly 






35 








40 


45 


Ser 


His 


Cys 


Glu 


He 


Val 


Leu His Ser Leu Gin Asp 


Leu Lys Cys Ser 




50 










55 60 




Ala 


lie 


Arg 


He 


Ala 


Asn 


Gly Glu His Thr Gly Arg 


Lys He Gly Ser 


65 










70 


75 


80 


Pro 


lie 


Thr 


Asp 


Leu 


Ala 


Leu Arg Met Leu His Asp 


Met Thr Gly Ala 










85 




90 


95 


Asp 


Ser 


Ser 


Val 


Ser 


Lys 


Cys Tyr Phe Thr Arg Ala 


Lys Ser Gly Val 








100 






105 


110 


Leu 


Met 


Lys 


Ser 


Leu 


Thr 


He Ala He Arg Asn Arg 


Glu Gin Arg Val 






115 








120 


125 


He 


Gly 


Leu 


Leu 


Cys 


He 


Asn Met Asn Leu Asp Val 


Pro Phe Ser Gin 




130 










135 140 




He 


Met 


Ser 


Thr 


Phe 


Val 


Pro Pro Glu Thr Pro Asp 


Val Gly Ser Ser 


145 










150 


155 


160 


Val 


Asn 


Phe 


Ala 


Ser 


Ser 


Val Glu Asp Leu Val Thr 


Gin Thr Leu Glu 










165 




170 


175 


Phe 


Thr 


He 


Glu 


Glu 


Val 


Asn Ala Asp Arg Asn Val 


Ser Asn Asn Ala 








180 






185 


190 • 


Lys 


Asn 


Arg 


Gin 


lie 


Val 


Leu Asn Leu Tyr Glu Lys 


Gly He Phe Asp 






195 








200 


205 


He 


Lys 


Asp 


Ala 


He 


Asn 


Gin Val Ala Asp Arg Leu 


Asn He Ser Lys 




210 










215 220 




His 


Thr 


Val 


Tyr 


Leu 


Tyr 


He Arg Gin Phe Lys Ser 


Gly Asp Phe Gin 


225 










230 


235 


240 


Gly 


Gin 


Asp 


Lys 
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<210> 316 
<211> 84 
<212> PRT 

<213> Escherichia coli 



<400> 316 



Met Thr Asp 


Lys 


lie 


Arg 


Thr Leu Gin Gly Arg Val 


Val Ser Asp Lys 


1 




5 




' 10 


15 


Met Glu Lys 


Ser 


He 


Val 


Val Ala He Glu Arg Phe 


Val Lys His Pro 




20 






25 


30 


lie Tyr Gly 


Lys 


Phe 


He 


Lys Arg Thr Thr Lys Leu 


His Val His Asp 


35 








40 


45 


Glu As n Asn 


Glu 


Cys 


Gly 


He Gly Asp Val Val Glu 


He Arg Glu Cys 


50 








55 60 




Arg Pro Leu 


Ser 


Lys 


Thr 


Lys Ser Trp Thr Leu Val 


Arg Val Val Glu 


65 






70 


75 


80 


Lys Ala Val 


Leu 











<210> 317 
<211> 63 
<212> PRT 

<213> Escherichia coli 
<400> 317 

Met Lys Ala Lys Glu Leu Arg Glu Lys Ser Val Glu Glu Leu Asn Thr 

15 10 15 

Glu Leu Leu Asn Leu Leu Arg Glu Gin Phe Asn Leu Arg Met Gin Ala 

20 25 30 

Ala Ser Gly Gin Leu Gin Gin Ser His Leu Leu Lys Gin Val Arg Arg 

35 40 45 

Asp Val Ala Arg Val Lys Thr Leu Leu Asn Glu Lys Ala Gly Ala 
50 55 60 



<210> 318 
<211> 136 
<212> PRT 

<213> Escherichia coli 
<400> 318 



Met 


Leu 


Gin 


Pro 


Lys 


Arg 


Thr Lys Phe Arg Lys Met 


His Lys Gly Arg 


1 








5 




10 


15 


Asn 


Arg 


Gly 


Leu 


Ala 


Gin 


Gly Thr Asp Val Ser Phe 


Gly Ser Phe Gly 








20 






25 


30 


Leu 


Lys 


Ala 


Val 


Gly 


Arg 


Gly Arg Leu Thr Ala Arg 


Gin He Glu Ala 






35 








40 


45 


Ala 


Arg 


Arg 


Ala 


Met 


Thr 


Arg Ala Val Lys Arg Gin 


Gly Lys He Trp 




50 










55 60 




He 


Arg 


Val 


Phe 


Pro 


Asp 


Lys Pro He Thr Glu Lys 


Pro Leu Ala Val 


65 










70 


75 


80 


Arg 


Met 


Gly 


Lys 


Gly 


Lys 


Gly Asn Val Glu Tyr Trp 


Val Ala Leu He 










85 




90 


95 


Gin 


Pro 


Gly 


Lys 


Val 


Leu 


Tyr Glu Met Asp Gly Val 


Pro Glu Glu Leu 








100 






105 


110 


Ala 


Arg 


Glu 


Ala 


Phe 


Lys 


Leu Ala Ala Ala Lys Leu 


Pro He Lys Thr 






115 








120 


125 


Thr 


Phe 


Val 


Thr 


Lys 


Thr 


Val Met 






130 










135 
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<210> 319 
<211> 233 
<212> PRT 

<213> Escherichia coli 



<400> 319 



Met 


Gly 


Gin Lys 


Val His Pro Asn 


1 






5 


Pro 


Trp 


Asn Ser 


Thr Trp Phe Ala 






20 




Leu 


Asp 


Ser Asp 


Phe Lys Val Arg 






35 


40 


Lys 


Ala 


Ser Val 


Ser Arg He Val 




50 




55 


Arg 


Val 


Thr He 


His Thr Ala Arg 


65 






70 


Gly 


Glu 


Asp Val 


Glu Lys Leu Arg 








85 


Val 


Pro 


Ala Gin 


He Asn He Ala 






100 




Ala 


Lys 


Leu Val 


Ala Asp Ser He 






115 


120 


Met 


Phe 


Arg Arg 


Ala Met Lys Arg 




130 




135 


Gly 


Ala 


Lys Gly 


He Lys val Glu 


145 






150 


Glu 


He 


Ala Arg 


Thr Glu Trp Tyr 








165 


Thr 


Leu 


Arg Ala 


Asp He Asp Tyr 






180 




Tyr 


Gly 


Val He 


Gly Val Lys Val 






195 


200 


Gly 


Gly 


Met Ala 


Ala Val Glu Gin 




210 




215 


Lys 


Lys 


Gin Gin 


Arg Lys Gly Arg 


225 






230 



Gly lie Arg Leu Gly He Val Lys 

10 ~* 15 
Asn Thr Lys Glu Phe Ala Asp Asn 
25 30 
Gin Tyr Leu Thr Lys Glu Leu Ala 
45 

He Glu Arg Pro Ala Lys Ser He 
60 

Pro Gly He Val He Gly Lys Lys 

75 80 
Lys Val Val Ala Asp He Ala Gly 

90 95 
Glu Val Arg Lys Pro Glu Leu Asp 
105 110 
Thr Ser Gin Leu Glu Arg Arg Val 
125 

Ala Val Gin Asn Ala Met Arg Leu 
140 

Val Ser Gly Arg Leu Gly Gly Ala 
155 160 
Arg Glu Gly Arg Val Pro Leu His 

170 175 
Asn Thr Ser Glu Ala His Thr Thr 
185 190 
Trp He Phe Lys Gly Glu He Leu 
205 

Pro Glu Lys Pro Ala Ala Gin Pro 
220 

Lys 



<210> 320 
<211> 110 
<212> PRT 

<213> Escherichia coli 
<400> 320 



Met Glu 


Thr 


He 


Ala 


Lys His Arg His Ala Arg Ser Ser 


Ala Gin Lys 


1 






5 


10 


15 


Val Arg 


Leu 


Val 


Ala 


Asp Leu He Arg Gly Lys Lys Val 


Ser Gin Ala 






20 




25 


30 


Leu Asp 


He 


Leu 


Thr 


Tyr Thr Asn Lys Lys Ala Ala Val 


Leu Val Lys 




35 






40 45 




Lys Val 


Leu 


Glu 


Ser 


Ala He Ala Asn Ala Glu His Asn 


Asp Gly Ala 


50 








55 60 




Asp He 


Asp 


Asp 


Leu 


Lys Val Thr Lys He Phe Val Asp 


Glu Gly Pro 


65 








70 75 


80 


Ser Met 


Lys 


Arg 


He 


Met Pro Arg Ala Lys Gly Arg Ala 


Asp Arg He 








85 


90 


95 


Leu Lys 


Arg 


Thr 


Ser 


His He Thr Val Val Val Ser Asp 


Arg 






100 




105 


110 
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<210> 321 
<211> 92 
<212> PRT 

<213> Escherichia coli 
<400> 321 



Met Pro Arg 


Ser Leu Lys 


Lys Gly Pro Phe 


He 


Asp 


Leu 


His 


Leu 


Leu 


1 


5 


10 










15 




Lys Lys Val 


Glu Lys Ala 


Val Glu Ser Gly 


Asp 


Lys 


Lys 


Pro 


Leu 


Arg 




20 


25 








30 






Thr Trp Ser 


Arg Arg Ser 


Thr He Phe Pro 


Asa 


Met 


lie 


Gly 


Leu 


Thr 


35 




40 






45 








lie Ala Val 


His Asn Gly 


Arg Gin His Val 


Pro 


Val 


Phe 


Val 


Thr 


Asp 


50 




55 




60 










Glu Met Val 


Gly His Lys 


Leu Gly Glu Phe 


Ala 


Pro 


Thr 


Arg 


Thr 


Tyr 


65 


70 




75 










80 


Arg Gly His 


Ala Ala Asp 


Lys Lys Ala Lys 


Lys 


Lys 











85 90 



<210> 322 
<211> 273 
<212> PRT 

<213> Escherichia coli 
<400> 322 



Met 


Ala 


Val 


Val 


Lys 


Cys 


Lys Pro Thr Ser 


Pro 


Gly 


Arg 


Arg 


His 


Val 


1 








5 




10 










15 




Val 


Lys 


Val 


Val 


Asn 


Pro 


Glu Leu His Lys 


Gly 


Lys 


Pro 


Phe 


Ala 


Pro 








20 






25 








30 






Leu 


Leu 


Glu 


Lys 


Asn 


Ser 


Lys Ser Gly Gly 


Arg 


Asn 


Asn 


Asn 


Gly 


Arg 






35 








40 






45 








He 


Thr 


Thr 


Arg 


His 


He 


Gly Gly Gly His 


Lys 


Gin 


Ala 


Tyr 


Arg 


He 




50 










55 




60 










Val 


Asp 


Phe 


Lys 


Arg 


Asn 


Lys Asp Gly He 


Pro 


Ala 


Val 


Val 


Glu 


Arg 


65 










70 




75 










80 


Leu 


Glu 


Tyr 


Asp 


Pro 


Asn 


Arg Ser Ala Asn 


He 


Ala 


Leu 


Val 


Leu 


Tyr 










85 




90 










95 




Lys 


Asp 


Gly 


Glu 


Arg 


Arg 


Tyr He Leu Ala 


Pro 


Lys 


Gly 


Leu 


Lys 


Ala 








100 






105 








110 






Gly 


Asp 


Gin 


He 


Gin 


Ser 


Gly Val Asp Ala 


Ala 


He 


Lys 


Pro 


Gly 


Asn 






115 








120 






125 








Thr 


Leu 


Pro 


Met 


Arg 


Asn 


He Pro Val Gly 


Ser 


Thr 


Val 


His 


Asn 


Val 




130 










135 




140 










Glu 


Met 


Lys 


Pro 


Gly 


Lys 


Gly Gly Gin Leu 


Ala 


Arg 


Ser 


Ala 


Gly 


Thr 


145 










150 




155 










160 


Tyr 


Val 


Gin 


He 


Val 


Ala 


Arg Asp Gly Ala 


Tyr 


Val 


Thr 


Leu 


Arg 


Leu 










165 




170 










175 




Arg 


Ser 


Gly 


Glu 


Met 


Arg 


Lys Val Glu Ala 


Asp 


Cys 


Arg 


Ala 


Thr 


Leu 








180 






185 








190 






Gly 


Glu 


Val 


Gly 


Asn 


Ala 


Glu His Met Leu 


Arg 


Val 


Leu 


Gly 


Lys 


Ala 






195 








200 






205 








Gly 


Ala 


Ala 


Arg 


Trp 


Arg 


Gly Val Arg Pro 


Thr 


Val 


Arg 


Gly 


Thr 


Ala 




210 










215 




220 










Met 


Asn 


Pro 


Val 


Asp 


His 


Pro His Gly Gly 


Gly 


Glu 


Gly 


Arg 


Asn 


Phe 


225 










230 




235 










240 


Gly 


Lys 


His 


Pro 


Val 


Thr 


Pro Trp Gly Val 


Gin 


Thr 


Lys 


Gly 


Lys 


Lys 










245 




250 










255 




Thr 


Arg 


Ser 


Asn 


Lys 


Arg 


Thr Asp Lys Phe 


He 


Val 


Arg 


Arg 


Arg 


Ser 








260 






265 








270 
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Lys 



<210> 323 
<211> 100 
<212> PRT 

<213> Escherichia coli 
<400> 323 

Met lie Arg Glu Glu Arg Leu Leu Lys Val Leu Arg Ala Pro His Val 

1 5 10 15 

Ser Glu Lys Ala Ser Thr Ala Met Glu Lys Ser Asn Thr lie Val Leu 

20 25 30 

Lys Val Ala Lys Asp Ala Thr Lys Ala Glu lie Lys Ala Ala Val Gin 

35 " 40 . 45 

Lys Leu Phe Glu Val Glu Val Glu Val Val Asn Thr Leu Val Val Lys 

50 55 60 

Gly Lys Val Lys Arg His Gly Gin Arg He Gly Arg Arg Ser Asp Trp 
65 70 75 80 

Lys Lys Ala Tyr Val Thr Leu Lys Glu Gly Gin Asn Leu Asp Phe Val 
85 90 95 

Gly Gly Ala Glu 
100 



<210> 324 
<211> 201 
<212> PRT 

<213> Escherichia coli 
<400> 324 

Met Glu Leu Val Leu Lys Asp Ala Gin Ser Ala Leu Thr Val Ser Glu 

1 5 10 15 

Thr Thr Phe Gly Arg Asp Phe Asn Glu Ala Leu Val His Gin Val Val 

20 25 30 

Val Ala Tyr Ala Ala Gly Ala Arg Gin Gly Thr Arg Ala Gin Lys Thr 

35 40 45 

Arg Ala Glu Val Thr Gly Ser Gly Lys Lys Pro Trp Arg Gin Lys Gly 

50 55 60 

Thr Gly Arg Ala Arg Ser Gly Ser He Lys Ser Pro He Trp Arg Ser 
65 70 75 80 

Gly Gly Val Thr Phe Ala Ala Arg Pro Gin Asp His Ser Gin Lys Val 

85 90 95 

Asn Lys Lys Met Tyr Arg Gly Ala Leu Lys Ser He Leu Ser Glu Leu 

100 ~ 105 110 

Val Arg Gin Asp Arg Leu He Val Val Glu Lys Phe Ser Val Glu Ala 

115 120 125 

Pro Lys Thr Lys Leu Leu Ala Gin Lys Leu Lys Asp Met Ala Leu Glu 

130 135 140 

Asp Val Leu He He Thr Gly Glu Leu Asp Glu Asn Leu Phe Leu Ala 
145 150 * 155 160 

Ala Arg Asn Leu His Lys Val Asp Val Arg Asp Ala Thr Gly He Asp 

165 170 175 

Pro Val Ser Leu He Ala Phe Asp Lys Val Val Met Thr Ala Asp Ala 

180 185 190 

Val Lys Gin Val Glu Glu Met Leu Ala 
195 200 



<210> 325 
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<211> 209 


<212> PRT 


<213> Escherichia coli 


<400> 325 


Met 


lie. 


Gly Leu Val Gly Lys Lys 


1 




5 


Glu 


Asp 


Gly Val Ser He Pro Val 






20 


Arg 


Val 


Thr Gin Val Lys Asp Leu 






35 40 


Gin 


Val 


Thr Thr Gly Ala Lys Lys 




50 


55 


Ala 


Gly 


His Phe Ala Lys Ala Gly 


65 




70 


Glu 


Phe 


Arg Leu Ala Glu Gly Glu 






85 


Ser 


Val 


Glu Leu Phe Ala Asp Val 






ioo 


Ser 


Lys 


Gly Lys Gly Phe Ala Gly 






115 120 


Thr 


Gin 


Asp Ala Thr His Gly Asn 




130 


135 


Ser 


He 


Gly Gin Asn Gin Thr Pro 


145 




150 


Met 


Ala 


Gly Gin Met Gly Asn Glu 






165 


Val 


Val 


Arg Val Asp Ala Glu Arg 






180 


Val 


Pro 


Gly Ala Thr Gly Ser Asp 






195 200 


Ala 







Val Gly Met 


Thr Arg He Phe 


Thr 


10 


15 




Thr Val He 


Glu Val Glu Ala 


Asn 


25 


30 




Ala Asn Asp 


Gly Tyr Arg Ala 


He 




45 




Ala Asn Arg 


Val Thr Lys Pro 


Glu 




60 




Val Glu Ala 


Gly Arg Gly Leu 


Trp 


75 




80 


Glu Phe Thr 


Val Gly Gin Ser 


He 


90 


95 




Lys Lys Val 


Asp Val Thr Gly 


Thr 


105 


110 




Thr Val Lys 


Arg Trp Asn Phe 


Arg 




125 




Ser Leu Ser 


His Arg Val Pro 


Gly 




140 




Gly Lys Val 


Phe Lys Gly Lys 


Lys 


155 




160 


Arg Val Thr 


Val Gin Ser Leu 


Asp 


170 


175 




Asn Leu Leu 


Leu Val Lys Gly 


Ala 


185 


190 




Leu He Val 


Lys Pro Ala Val 


Lys 




205 





<210> 326 
<211> 103 
<212> PRT 

<213> Escherichia coli 
<400> 326 



Met 


Gin 


Asn 


Gin 


Arg 


He Arg He Arg Leu Lys 


Ala Phe Asp His 


Arg 


1 








5 


10 


15 




Leu 


He 


Asp 


Gin 


Ala 


Thr Ala Glu He Val Glu 


Thr Ala Lys Arg 


Thr 








20 




25 


30 




Gly 


Ala 


Gin 


Val 


Arg 


Gly Pro He Pro Leu Pro 


Thr Arg Lys Glu 


Arg 






35 






40 


45 




Phe 


Thr 


Val 


Leu 


He 


Ser Pro His Val Asn Lys 


Asp Ala Arg Asp 


Gin 




50 








55 


60 




Tyr 


Glu 


He 


Arg 


Thr 


His Leu Arg Leu Val Asp 


He Val Glu Pro 


Thr 


65 










70 75 




80 


Glu 


Lys 


Thr 


Val 


Asp 


Ala Leu Met Arg Leu Asp 


Leu Ala Ala Gly 


Val 










85 


90 






Asp 


Val 


Gin 


He 


Ser 


Leu Gly 












100 











<210> 327 
<211> 104 
<212> PRT 

<213> Escherichia coli 



-350- 



WO 01/48209 



PCT/DS00/34419 



<400> 327 





lie 




Lys 


Ala Phe 


Val 


Met 


Gin 


Val 


Asn Pt"o Asia Ala His 


Glu 


1 








5 








10 


15 




Glu 


Tvr 


Gin 


m y 

20 


Arg His 


Asn 


Pro 


lie 

25 


Tro 


Pro Glu Leu Glu Ala 
30 


Val 


Leu 


Lys 


Ser 
35 




\DXy Aid 




Asn 
40 


Tyr 




Tift T*\ jt» Tmi Jen T.vrc 

-L-Lv2 iyr jjtsu, nop Jjyo 
45 




Arg 


Asn 
50 


Leu 


Leu 


Phe Ala 


Met 
55 


Val 


Glu 


He 


Glu Ser Glu Glu Arg 
60 


Trp 


Asn 


Ala 


Val 


Ala 


Ser Thr 


Asp 


Val 


Cys 


Gin 


Arg Trp Trp Lys Tyr 


Met 


65 








70 










75 


80 


Thr 


Asp 


Val 


Met 


Pro Ala 
85 


Asn 


Pro 


Asp 


Asn 
90 


Ser Pro Val Ser Ser 
95 


Glu 


Leu 


Gin 


Glu 


Val 
100 


Phe Tyr 


Leu 


Pro 











<210> 328 

<211> 287 

<212> PRT 

<213> Kscherichia coli 



<400> 328 



wet 


i. j.e 


Arg 


Ser 


Ma 4* 


lnx 




Tyr 




Arg 


Aig V3j.il j.j.e j_iys Lr±y 


i7j.11 










c 
□ 










J.U 


J.D 




rrp 


\s±y 


Ser 


AJLa 


Thr 


irp 


fll it 


rJct 


Arg 


Ser 


vdi ash L7J.U /iz y iyr 


Leu 








OA 

z. yj 










9 R 








ulu 


iiir 


Tyr 


r*ne 


Arg 


Leu 


Pro 


\j±XJL 


V7J.I1 


rile 


nTy bcx Leu LjJ.ll rl O 


Vex J. 






35 










40 






45 




Val 


Arg 


Glu 


Arg 


He 


Arg 


Ser 


Arg 


Leu 


Thr 


Arg Gly Lys Val Glu 


Cys 




50 










55 








60 




Thr 


Leu 


Arg 


Tyr 


Glu 


Pro 


Asp 


Val 


Ser 


Ala 


Gin Gly Glu Leu He 


Leu 


65 










70 










75 


80 


Asn 


Glu 


Lys 


Leu 


Ala 


Lys 


Gin 


Leu 


Val 


Thr 


Ala Ala Asn Trp Val 


Lys 










85 










90 


95 




Met 


Gin 


Ser 


Asp 


Glu 


Gly 


Glu 


He 


Asn 


Pro 


Val Asp He Leu Arg 


Trp 








100 










105 




110 




Pro 


Gly 


Val 


Met 


Ala 


Ala 


Gin 


Glu 


Gin 


Asp 


Leu Asp Ala He Ala 


Ala 






115 










120 






125 




Glu 


He 


Leu 


Ala 


Ala 


Leu 


Asp 


Gly 


Thr 


Leu 


Asp Asp Phe He Val 


Ala 




130 










135 








140 




Arg 


Glu 


Thr 


Glu 


Gly 


Gin 


Ala 


Leu 


Lys 


Ala 


Leu He Glu Gin Arg 


Leu 


145 










150 










155 


160 


Glu 


Gly 


Val 


Thr 


Ala 


Glu 


Val 


Val 


Lys 


Val 


Arg Ser His Met Pro 


Glu 










165 










170 


175 




He 


Leu 


Gin 


Trp 


Gin 


Arg 


Glu 


Arg 


Leu 


Val 


Ala Lys Leu Glu Asp 


Ala 








180 










185 




190 




Gin 


Val 


Gin 


Leu 


Glu 


Asn 


Asn 


Arg 


Leu 


Glu 


Gin Glu Leu Val Leu 


Leu 






195 










200 






205 




Ala 


Gin 


Arg 


He 


Asp 


Val 


Ala 


Glu 


Glu 


Leu 


Asp Arg Leu Glu Ala 


His 




210 










215 








220 




Val 


Lys 


Glu 


Thr 


Tyr 


Asn 


He 


Leu 


Lys 


Lys 


Lys Glu Ala Val Gly 


Arg 


225 










230 










235 


240 


Arg 


Leu 


Asp 


Phe 


Met 


Met 


Gin 


Glu 


Phe 


Asn 


Arg Glu Ser Asn Thr 


Leu 










245 










250 


255 




Ala 


Ser 


Lys 


Ser 


He 


Asn 


Ala 


Glu 


Val 


Thr 


Asn Ser Ala He Glu 


Leu 








260 










265 




270 




Lys 


Val 


Leu 


He 


Glu 


Gin 


Met 


Arg 


Glu 


Gin 


lie Gin Asn He Glu 








275 










280 






285 
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<210> 329 
<211> 163 
<212> PRT 

<213> Escherichia coli 



<400> 329 



Met 


Ser 


Thr 


Glu 


Thr 


He 


Glu 


He 


Phe 


Asn 


Asn 


Ser 


Asp 


Glu 


Trp 


Ala 


1 








5 










10 










15 




Asn 


Gin 


Leu 


Lys 
20 


His 


Ala 


Leu 


Ser 


Lys 
25 


Gly 


Glu 


Asn 


Leu 


Ala 
30 


Leu 


Leu 


His 


Gly 


Leu 
35 


Thr 


Pro 


Asp 


He 


Leu 
40 


Asp 


Arg 


He 


Tyr 


Ala 
45 


Tyr 


Ala 


Phe 


Asp 


Tyr 
50 


His 


Glu 


Lys 


Gly 


Asn 
55 


He 


Thr 


Asp 


Ala 


Glu 
60 


He 


Tyr 


Tyr 


Lys 


Phe 


Leu 


Cys 


He 


Tyr 


Ala 


Phe 


Glu 


Asn 


His 


Glu 


Tyr 


Leu 


Lys 


Asp 


Phe 


65 










70 










75 










80 


Ala 


Ser 


Val 


Cys 


Gin 
85 


Pro 


Lys 


Lys 


Lys 


Tyr 
90 


Gin 


Gin 


Ala 


Tyr 


Asp 
95 


Leu 


Tyr 


Lys 


Leu 


Ser 
100 


Tyr 


Asn 


Tyr 


Phe 


Pro 
105 


Tyr 


Asp 


Asp 


Tyr 


Ser 
110 


Val 


He 


Tyr 


Arg 


Met 
115 


Gly 


Gin 


Cys 


Gin 


lie 
120 


Gly 


Ala 


Lys 


Asn 


He 
125 


Asp 


Asn 


Ala 


Met 


Gin 
130 


Cys 


Phe 


Tyr 


His 


He 
135 


He 


Asn 


Asn 


Cys 


Glu 
140 


Asp 


Asp 


Ser 


Val 


Lys 


Ser 


Lys 


Ala 


Gin 


Ala 


Tyr 


He 


Glu 


Leu 


Leu 


Asn 


Asp 
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245 




Ala 


Leu 


Leu 


Gly 


Arg 


Met 








260 






Ala 


Glu 


Glu 


Met 


Gly 


Val 






275 








Gly 


Ser 


He 


Thr 


Ser 


Val 




290 










Thr 


Asp 


Pro 


Ser 


Pro 


Ala 


305 










310 


Val 


Leu 


Ser 


Arg 


Gin 


He 










325 




Pro 


Leu 


Asp 


Ser 


Thr 


Ser 








340 






Glu 


His 


Tyr 


Asp 


Thr 


Ala 






355 








Gin 


Glu 


Leu 


Lys 


Asp 


He 




370 










Glu 


Glu 


Asp 


Lys 


Leu 


Val 


385 










390 


Leu 


Ser 


Gin 


Pro 


Phe 


Phe 



Thr 


Lys 


Lys 


Ala 


Met 




135 












Val 


Gin 


Val 


He 


Gly 


Ala 








10 






Val 


Pro 


Arg 


Val 


Tyr 


Asp 






25 








Leu 


Val 


Leu 


Glu 


Val 


Gin 




40 










He 


Ala 


Met 


Gly 


Ser 


Ser 


55 










60 


Asp 


Leu 


Glu 


His 


Pro 


He 










75 




Arg 


He 


Met 


Asn 


Val 


Leu 








90 






Gly 


Glu 


Glu 


Glu 


Arg 


Trp 






105 








Glu 


Leu 


Ser 


Asn 


Ser 


Gin 




120 










Asp 


Leu 


Met 


Cys 


Pro 


Phe 


135 










140 


Gly 


Ala 


Gly 


Val 


Gly 


Lys 










155 




He 


Ala 


He 


Glu 


His 


Ser 








170 






Arg 


Thr 


Arg 


Glu 


Gly 


Asn 






185 








Val 


He 


Asp 


Lys 


Val 


Ser 




200 










Gly 


Asn 


Arg 


Leu 


Arg 


Val 


215 










220 


Phe 


Arg 


Asp 


Glu 


Gly 


Arg 










235 




Arg 


Tyr 


Thr 


Leu 


Ala 


Gly 








250 






Pro 


Ser 


Ala 


Val 


Gly 


Tyr 






265 








Leu 


Gin 


Glu 


Arg 


He 


Thr 




280 










Gin 


Ala 


Val 


Tyr 


Val 


Pro 


295 










300 


Thr 


Thr 


Phe 


Ala 


His 


Leu 










315 




Ala 


Ser 


Leu 


Gly 


He 


Tyr 








330 






Arg 


Gin 


Leu 


Asp 


Pro 


Leu 






345 








Arg 


Gly 


Val 


Gin 


Ser 


He 




360 










He 


Ala 


He 


Leu 


Gly 


Met 


375 










380 


Val 


Ala 


Arg 


Ala 


Arg 


Lys 










395 




Val 


Ala 


Glu 


Val 


Phe 


Thr 
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val 


Val 


Asp 


Val 






15 




Ala 


Leu 


Glu 


Val 




30 






Gin 


Gin 


Leu 


Gly 


45 








Asp 


Gly 


Leu 


Arg 


Glu 


Val 


Pro 


Val 








80 


Gly 


Glu 


Pro 


Val 






95 




Ala 


He 


His 


Arg 




110 






Glu 


Leu 


Leu 


Glu 


125 








Ala 


Lys 


Gly 


Gly 


Thr 


Val 


Asn 


Met 








160 


Gly 


Tyr 


Ser 


Val 






175 




Asp 


Phe 


Tyr 


His 




190 






Leu 


Val 


Tyr 


Gly 


205 






Gly 


Ala 


Leu 


Thr 


Asp 


Val 


Leu 


Leu 








240 


Thr 


Glu 


Val 


Ser 






255 




Gin 


Pro 


Thr 


Leu 




270 






Ser 


Thr 


Lys 


Thr 


285 








Ala 


Asp 


Asp 


Leu 


Asp 


Ala 


Thr 


Val 








320 


Pro 


Ala 


Val 


Asp 






335 




Val 


Val 


Gly 


Gin 




350 






Leu 


Gin 


Arg 


Tyr 


365 








Asp 


Glu 


Leu 


Ser 


He 


Gin 


Arg 


Phe 








400 


Gly 


Ser 


Pro 


Gly 
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405 

Lys Tyr Val Ser Leu Lys Asp Thr 
420 

Glu Gly Glu Tyr Asp His Leu Pro 
435 440 
Ser He Glu Glu Ala Val Glu Lys 
450 455 



410 415 
He Arg Gly Phe Lys Gly He Met 
425 430 
Glu Gin Ala Phe Tyr Met Val Gly 
445 

Ala Lys Lys Leu 
460 



<210> 337 
<211> 287 
<212> PRT 

<213> Escherichia coli 



<400> 337 



Met 


Ala 


Gly 


Ala 


Lys 


Glu 


He Arg 


Ser Lys He Ala 


Ser 


Val 


Gin Asn 


1 








5 






10 






15 


Thr 


Gin 


Lys 


lie 


Thr 


Lys 


Ala Met 


Glu Met Val Ala 


Ala 


Ser 


Lys Met 








20 








25 




30 




Arg 


Lys 


Ser 


Gin 


Asp 


Arg 


Met Ala 


Ala Ser Arg Pro 


Tyr 


Ala 


Glu Thr 






35 








40 




45 






Met 


Arg 


Lys 


Val 


He 


Gly 


His Leu 


Ala His Gly Asn 


Leu 


Glu 


Tyr Lys 




50 










55 


60 








His 


Pro 


Tyr 


Leu 


Glu 


Asp 


Arg Asp 


Val Lys Arg Val 


Gly 


Tyr 


Leu Val 


65 










70 




75 






80 


Val 


Ser 


Thr 


Asp 


Arg 


Gly 


Leu Cys 


Gly Gly Leu Asn 


He 


Asn 


Leu Phe 










85 






90 






95 


Lys 


Lys 


Leu 


Leu 


Ala 


Glu 


Met Lys 


Thr Tip Thr Asp 


Lys 


Gly 


Val Gin 








100 








105 




110 




Cys 


Asp 


Leu 


Ala 


Met 


He 


Gly Ser 


Lys Gly Val Ser 


Phe 


Phe 


Asn Ser 






115 








120 




125 






Val 


Gly 


Gly 


Asn 


Val 


Val 


Ala Gin 


Val Thr Gly Met 


Gly 


Asp 


Asn Pro 




130 










135 


140 








Ser 


Leu 


Ser 


Glu 


Leu 


He 


Gly Pro 


Val Lys Val Met 


Leu 


Gin 


Ala Tyr 


145 










150 




155 






160 


Asp 


Glu 


Gly 


Arg 


Leu 


Asp 


Lys Leu 


Tyr He Val Ser 


Asn 


Lys 


Phe He 










165 






170 






175 


Asn 


Thr 


Met 


Ser 


Gin 


Val 


Pro Thr 


He Ser Gin Leu 


Leu 


Pro 


Leu Pro 








180 








185 




190 




Ala 


Ser 


Asp 


Asp 


Asp 


Asp 


Leu Lys 


His Lys Ser Trp 


Asp 


Tyr 


Leu Tyr 






195 








200 




205 






Glu 


Pro 


Asp 


Pro 


Lys 


Ala 


Leu Leu 


Asp Thr Leu Leu 


Arg 


Arg 


Tyr Val 




210 










215 


220 








Glu 


Ser 


Gin 


Val 


Tyr 


Gin 


Gly Val 


Val Glu Asn Leu 


Ala 


Ser 


Glu Gin 


225 










230 




235 






240 


Ala 


Ala 


Arg 


Met 


Val 


Ala 


Met Lys 


Ala Ala Thr Asp 


Asn 


Gly 


Gly Ser 










245 






250 






255 


Leu 


He 


Lys 


Glu 


Leu 


Gin 


Leu Val 


Tyr Asn Lys Ala 


Arg 


Gin 


Ala Ser 








260 








265 




270 




lie 


Thr 


Gin 


Glu 


Leu 


Thr 


Glu He 


Val Ser Gly Ala 


Ala 


Ala 


Val 






275 








280 




285 







<210> 338 
<211> 513 
<212> PRT 

<213> Escherichia coli 
<400> 338 

Met Gin Leu Asn Ser Thr Glu He Ser Glu Leu He Lys Gin Arg He 
15 10 15 
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Ala 


Gin 


Phe 


Asn 








20 


Ser 


Val 


Ser 


Asp 






35 




Gin 


Gly 


Glu 


Met 




50 






Asn 


Leu 


Glu 


Arg 


65 








Asp 


Leu 


Ala 


Glu 


Val 


Pro 


Val 


Gly 








100 


Ala 


Pro 


He 


Asp 






115 




Val 


Glu 


Ala 


He 




130 






Pro 


Val 


Gin 


Thr 


145 








Arg 


Gly 


Gin 


Arg 


Ala 


Leu 


Ala 


He 








180 


Cys 


lie 


Tyr 


Val 






195 




Val 


Arg 


Lys 


Leu 




210 






Val 


Ala 


Thr 


Ala 


225 








Ala 


Gly 


Cys 


Ala 


Leu 


He 


He 


Tyr 








260 


lie 


Ser 


Leu 


Leu 






275 




Asp 


Val 


Phe 


Tyr 




290 






Asn 


Ala 


Glu 


Tyr 


305 








Thr 


Gly 


Ser 


Leu 


Val 


Ser 


Ala 


Phe 








340 


He 


Phe 


Leu 


Glu 






355 




Asn 


Pro 


Gly 


He 




370 






He 


Met 


Lys 


Lys 


385 








Arg 


Glu 


Leu 


Ala 


Thr 


Arg 


Lys 


Gin 








420 


Gin 


Lys 


Gin 


Tyr 






435 




Phe 


Ala 


Ala 


Glu 




450 






Gly 


Ser 


Phe 


Glu 


465 








Pro 


Leu 


Met 


Gin 


Glu 


Gly 


Lys 


Leu 



Val 


Val 


Ser 


Glu 


Gly 


Val 


11 e 


Arg 








40 


He 


Ser 


Leu 


Pro 










Asp 


Ser 


Val 


Gly 




70 






Gly 


Met 


Lys 


Val 


85 








Arg 


Gly 


Leu 


Leu 


Gly 


Lys 


Gly 


Pro 








120 


Ala 


Pro 


Gly 


Val 






135 




Gly 


Tyr 


Lys 


Ala 




150 






Glu 


Leu 


He 


He 


165 








Asp 


Ala 


He 


He 


Ala 


He 


Gly 


Gin 








200 


Glu 


Glu 


His 


Gly 






215 




Ser 


Glu 


Ser 


Ala 




230 






Met 


Gly 


Glu 


Tyr 


245 








Asp 


Asp 


Leu 


Ser 


Leu 


Arg 


Arg 


Pro 








280 


Leu 


His 


Ser 


Arg 






295 




Val 


Glu 


Ala 


Phe 




310 






Thr 


Ala 


Leu 


Pro 


325 








Val 


Pro 


Thr 


Asn 


Thr 


Asn 


Leu 


Phe 








360 


Ser 


Val 


Ser 


Arg 






375 




Leu 


Ser 


Gly 


Gly 










Ala 


Phe 


Ser 


Gin 


405 








Leu 


Asp 


His 


Gly 


Ala 


Pro 


Met 


Ser 








AAfl 


Arg 


Gly 


Tyr 


Leu 






455 




Ala 


Ala 


Leu 


Leu 




470 






Glu 


He 


Asn 


Gin 


485 








Lys 


Gly 


He 


Leu 



Ala 


7-1- « _ 

His 


Asn 


Glu 


^3 








He 


His 


Gly 


Leu 


Gly 


Asn 


Arg 


Tyr 








60 


Ala 


Val 


Val 


Met 






75 




Lys 


Cys 


Thr 


Gly 




90 






Gly 


Arg 


Val 


Val 


105 








Leu 


Asp 


His 


Asp 


He 


Glu 


Arg 


Gin 








140 


Val 


Asp 


Ser 


Met 






155 




Gly 


Asp 


Arg 


Gin 




170 






Asn 


Gin 


Arg 


Asp 


185 








Lys 


Ala 


Ser 


Thr 


Ala 


Leu 


Ala 


Asn 








220 


Ala 


Leu 


Gin 


Tyr 






235 




Phe 


Arg 


Asp 


Arg 




250 






Lys 


Gin 


Ala 


Val 


265 








Pro 


Gly 


Arg 


Glu 


Leu 


Leu 


Glu 


Arg 








300 


Thr 


Lys 


Gly 


Glu 






315 




He 


He 


Glu 


Thr 




330 






Val 


He 


Ser 


He 


345 








Asn 


Ala 


Gly 


He 


Val 


Gly 


Gly 


Ala 








380 


lie 


Arg 


Thr 


Ala 










Phe 


Ala 


Ser 


Asp 




at n 






Gin 


Lys 


val 


Thx 


AO C 








Val 


Ala 


Gin 


Gin 


Ala 


Asp 


Val 


Glu 








460 


Ala 


Tyr 


Val 


Asp 






475 




Thr 


Gly 


Gly 


Tyr 




490 






Asp 


Ser 


Phe 


Lys 
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Gly Thr 


He 


Val 


30 






Ala Asp 


Cys 


Met 


45 






Ala lie 


Ala 


Leu 


Gly Pro 


Tyr 


TV -1 _ 

Ala 






80 


Arg He 


Leu 


Glu 




95 




Asn Thr 


Leu 


Gly 


■tin 
110 






Gly Phe 


Ser 


Ala 


125 






Ser Val 


Asp 


Gin 


He Pro 


He 


Gly 






160 


Thr Gly 


Lys 


Thr 




175 




Ser Gly 


He 


Lys 


190 






He Ser 


Asn 


Val 


205 






Thr He 


Val 


Val 


Leu Ala 


Pro 


Tyr 






240 


Gly Glu 


Asp 


Ala 




ore 

255 




Ala" Tyr 


Arg 


Gin 


270 






Ala Phe 


Pro 


Gly 


285 






Ala Ala 


Arg 


t r 1 

Val 


Val Lys 


Gly 


Lys 






320 


Gin Ala 


Gly 


Asp 




335 




Thr Asp 


Gly 


Gin 


o c n 
350 






Arg Pro 


TV 1 — 

Ala 


Val 


365 






Ala Gin 


Thr 


Lys 


Leu Ala 


Gin 


Tyr 






400 


Leu Asp 


Asp 


Ala 








Glu Leu 


Leu 


Lys 








Ser Leu 


vai 


Leu 


445 






Leu Ser 


Lys 


He 


Arg Asp 


His 


Ala 






480 


Asn Asp 


Glu 


lie 




495 




Ala Thr 


Gin 


Ser 
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500 505 

Trp 



<210> 339 
<211> 177 
<212> PRT 

<213> Escherichia coli 



<400> 339 



Met 


Ser 


Glu Phe 


He 


Thr Val Ala Arg 


1 






5 




Asp 


Phe 


Ala Val 


Glu 


His Gin Ser Val 






20 




25 


Ala 


Phe 


Ala Ala 


Glu 


Val Thr Lys Asn 






35 




40 


Ser 


Gly 


Ala Leu 


Ala 


Pro Glu Thr Leu 




50 






55 


Cys 


Gly 


Glu Gin 


Leu 


Asp Glu Asn Gly 


65 








70 


Ala 


Glu 


Asia Gly 


Arg 


Leu Asn Ala Leu 


lie 


His 


Leu Arg 


85 
Ala 


Val Ser Glu Ala 






100 




105 


Ser 


Ala 


Ala Ala 


Leu 


Ser Glu Gin Gin 






115 




120 


Met 


Glu 


Lys Arg 


Leu 


Ser Arg Lys Val 




130 






135 


Lys 


Ser 


Val Met 


Ala 


Gly Val He He 


145 








150 


Asp 


Gly 


Ser Val 


Arg 


Gly Arg Leu Glu 



165 



Ser 



<210> 340 
<211> 156 
<212> PRT 

<213> Escherichia coli 



<400> 340 








Met 


Asn 


Leu 


Asn 


Ala 


Thr He Leu Gly 


l 








5 




Phe 


Val 


Leu 


Phe 


Cys 


Met Lys Tyr Val 








20 




25 


He 


Glu 


Lys 


Arg 


Gin 


Lys Glu He Ala 






35 






40 


Arg 


Ala 


His 


Lys 


Asp 


Leu Asp Leu Ala 




50 








55 


Leu 


Lys 


Lys 


Ala 


Lys 


Ala Glu Ala Gin 


65 










70 


Lys 


Arg 


Arg 


Ser 


Gin 


He Leu Asp Glu 










85 




Glu 


Arg 


Thr 


Lys 


He 


Val Ala Gin Ala 








100 




105 


Arg 


Lys 


Arg 


Ala 


Arg 


Glu Glu Leu Arg 






115 






120 


Val 


Ala 


Gly 


Ala 


Glu 


Lys He He Glu 




130 








135 
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510 



Pro 


Tyr Ala Lys Ala 


Ala 


Phe 


10 




15 




Glu 


Arg Trp Gin Asp 


Met 


Leu 




30 






Glu 


Gin Met Ala Glu 


Leu 


Leu 




45 






Ala 


Glu Ser Phe He 


Ala 


Val 




60 






Gin 


Asn Leu He Arg 


Val 


Met 




75 




80 


Pro 


Asp Val Leu Glu 


Gin 


Phe 


90 




95 




Thr 


Ala Glu Val Asp 


Val 


He 




110 






Leu 


Ala Lys He Ser 


Ala 


Ala 




125 






Lys 


Leu Asn Cys Lys 


He Asp 




140 






Arg 


Ala Gly Asp Met 


Val 


He 




155 




160 


Arg 


Leu Ala Asp Val 


Leu Gin 


170 




175 




Gin 


Ala He Ala Phe 


Val 


Leu 


10 




15 




Trp 


Pro Pro Leu Met 


Ala 


Ala 




30 






Asp 


Gly Leu Ala Ser 


Ala 


Glu 




45 






Lys 


Ala Ser Ala Thr 


Asp Gin 




60 






Val 


He He Glu Gin 


Ala 


Asn 




75 




80 


Ala 


Lys Ala Glu Ala 


Glu 


Gin 


90 




95 




Gin 


Ala Glu He Glu 


Ala 


Glu 




110 






Lys 


Gin Val Ala He 


Leu 


Ala 




125 






Arg 


Ser Val Asp Glu 

1 An 


Ala Ala 



140 
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Asn Ser Asp lie Val Asp Lys Leu Val Ala Glu Leu 
145 " 150 155 



<210> 341 
<211> 79 
<212> PRT 

<213> Escherichia coli 
<400> 341 



Met Glu Asn Leu Asn Met Asp Leu 


Leu Tyr Met 


Ala 


Ala 


Ala 


Val 


Met 


1 5 


10 








15 




Met Gly Leu Ala Ala lie Gly Ala 


Ala He Gly 


He 


Gly 


lie 


Leu 


Gly 


20 


25 






30 






Gly Lys Phe Leu Glu Gly Ala Ala 


Arg Gin Pro 


Asp 


Leu 


He 


Pro 


Leu 


35 40 






45 








Leu Arg Thr Gin Phe Phe lie Val 


Met Gly Leu 


Val 


Asp 


Ala 


He 


Pro 


50 55 




60 










Met lie Ala Val Gly Leu Gly Leu 


Tyr Val Met 


Phe 


Ala 


Val 


Ala 





65 70 75 



<210> 342 
<211> 271 
<212> PRT 

<213> Escherichia coli 



<400> 342 



Met 


Ala 


Ser 


Glu 


Asn 


Met Thr Pro 


Gin Asp Tyr 


He 


Gly 


His 


His 


Leu 


1 








5 




10 








15 




Asn 


Asn 


Leu 


Gin 


Leu 


Asp Leu Arg 


Thr Phe Ser 


Leu 


Val 


Asp 


Pro 


Gin 








20 






25 






30 






Asn 


Pro 


Pro 


Ala 


Thr 


Phe Trp Thr 


He Asn He 


Asp 


Ser 


Met 


Phe 


Phe 






35 






40 






45 








Ser 


Val 


Val 


Leu 


Gly 


Leu Leu Phe 


Leu Val Leu 


Phe 


Arg 


Ser 


Val 


Ala 




50 








55 




60 










Lys 


Lys 


Ala 


Thr 


Ser 


Gly Val Pro 


Gly Lys Phe 


Gin 


Thr 


Ala 


He 


Glu 


65 










70 


75 










80 


Leu 


Val 


He 


Gly 


Phe 


Val Asn Gly 


Ser Val Lys 


Asp 


Met 


Tyr 


His 


Gly 










85 




90 








95 




Lys 


Ser 


Lys 


Leu 


He 


Ala Pro Leu 


Ala Leu Thr 


He 


Phe 


Val 


Trp 


Val 








100 






105 






110 






Phe 


Leu 


Met 


Asn 


Leu 


Met Asp Leu 


Leu Pro He 


Asp 


Leu 


Leu 


Pro 


Tyr 






115 






120 






125 








He 


Ala 


Glu 


His 


Val 


Leu Gly Leu 


Pro Ala Leu 


Arg 


Val 


Val 


Pro 


Ser 




130 








135 




140 










Ala 


Asp 


Val 


Asn 


Val 


Thr Leu Ser 


Met Ala Leu 


Gly 


Val 


Phe 


He 


Leu 


145 










150 


155 










160 


He 


Leu 


Phe 


Tyr 


Ser 


He Lys Met 


Lys Gly He 


Gly 


Gly 


Phe 


Thr 


Lys 










165 




170 








175 




Glu 


Leu 


Thr 


Leu 


Gin 


Pro Phe Asn 


His Trp Ala 


Phe 


He 


Pro 


Val 


Asn 








180 






185 






190 






Leu 


He 


Leu 


Glu 


Gly 


Val Ser Leu 


Leu Ser Lys 


Pro 


Val 


Ser 


Leu 


Gly 






195 






200 






205 








Leu 


Arg 


Leu 


Phe 


Gly 


Asn Met Tyr 


Ala Gly Glu 


Leu 


He 


Phe 


He 


Leu 




210 








215 




220 










He 


Ala 


Gly 


Leu 


Leu 


Pro Trp Trp 


Ser Gin Trp 


He 


Leu 


Asn 


Val 


Pro 


225 










230 


235 










240 


Trp 


Ala 


He 


Phe 


His 


He Leu He 


He Thr Leu 


Gin 


Ala 


Phe 


He 


Phe 










245 




250 








255 




Met 


Val 


Leu 


Thr 


He 


Val Tyr Leu 


Ser Met Ala 


Ser 


Glu 


Glu 


His 
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260 265 270 



<210> 343 
<211> 130 
<212> PRT 

<213> Escherichia coli 
<400> 343 

Met Lys Asn Val Met Ser Val Ser Leu Val Ser Arg Asn Val Ala Arg 

1 5 10 15 

Lys Leu Leu Leu Val Gin Leu Leu Val Val lie Ala Ser Gly Leu Leu 

20 25 30 

Phe Ser Leu Lys Asp Pro Phe Tip Gly Val Ser Ala He Ser Gly Gly 

35 40 45 

Leu Ala Val Phe Leu Pro Asn Val Leu Phe Met He Phe Ala Trp Arg 

50 55 60 

His Gin Ala His Thr Pro Ala Lys Gly Arg Val Ala Trp Thr Phe Ala 
65 70 75 80 

Phe Gly Glu Ala Phe Lys Val Leu Ala Met Leu Val Leu Leu Val Val 

85 90 95 

Ala Leu Ala Val Leu Lys Ala Val Phe Leu Pro Leu He Val Thr Trp 

100 105 110 

Val Leu Val Leu Val Val Gin He Leu Ala Pro Ala Val He Asn Asn 
115 120 125 

Lys Gly 
130 



<210> 344 
<211> 413 
<212> PRT 

<213> Escherichia coli 
<400> 344 

Met Gin Lys His Gly Asp Arg Tyr Val Trp He Asn Pro Pro Ala He 

15 10 15 

Pro Leu Ser Thr Glu Glu Met Asp Ser Val Phe Ala Leu Pro Tyr Lys 

20 25 30 

Arg Val Pro His Pro Ala Tyr Gly Asn Ala Arg He Pro Ala Tyr Glu 

35 40 45 

Met He Arg Phe Ser Val Asn He Met Arg Gly Cys Phe Gly Gly Cys 

50 55 60 

Ser Phe Cys Ser He Thr Glu His Glu Gly Arg He He Gin Ser Arg 
65 70 75 80 

Ser Glu Asp Ser He He Asn Glu He Glu Ala He Arg Asp Thr Val 

85 90 95 

Pro Gly Phe Thr Gly Val He Ser Asp Leu Gly Gly Pro Thr Ala Asn 

100 105 110 

Met Tyr Met Leu Arg Cys Lys Ser Pro Arg Ala Glu Gin Thr Cys Arg 

115 120 125 

Arg Leu Ser Cys Val Tyr Pro Asp He Cys Pro His Met Asp Thr Asn 

130 135 140 

His Glu Pro Thr He Asn Leu Tyr Arg Arg Ala Arg Asp Leu Lys Gly 
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<210> 359 
<211> 186 
<212> PRT 

<213> Escherichia coli 
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Lys 


Glu 


Leu Glu Cys 


He Lys 


Glu 


Thr 


He 


Ser 


Lys 


Tyr 




130 








135 






140 










Cys 


Ala 


Lys 


Phe 


Thr 


Arg Lys Glu 


Ala He 


Leu 


Thr 


Pro 


Leu 


Val 


His 


145 










150 




155 










160 


Phe 


Asn 


Lys 


Thr 


Ser 


He Thr Ser 


Asp Cys 


Trp 


Gin 


He 


Leu 


Phe 


Phe 










165 




170 










175 




Ser 


Pro 


Asp 


His 


Phe 


Asn Asn Asp 


Phe Tyr 















180 185 



<210> 360 
<211> 395 
<212> PRT 

<213> Escherichia coli 
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Glu 
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540 
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Lys Val Tyr Pro Met Val Pro Pro Gly Ala Ala Asn Thr Glu Met Val 
545 550 555 560 

Gly Glu 



<210> 364 
<211> 32 
<212> PRT 

<213> Escherichia coli 
<400> 364 

Met Thr Thr Ser Met Leu Asn Ala 

1 5 
Ala Ala Val Val Val Val Arg Val 
20 



Lys Leu Leu Pro Thr Ala Pro Ser 

10 15 
Val Val Val Val Gly Asn Ala Pro 
25 30 



<210> 365 
<211> 338 
<212> PRT 

<213> Escherichia coli 



<400> 365 



Met 


Phe 


Val 


He 


Trp 


Ser 


His Arg Thr Gly 


Phe He Met Ser His Gin 


1 








5 




10 


15 


Leu 


Thr 


Phe 


Ala 


Asp 


Ser 


Glu Phe Ser Ser 


Lys Arg Arg Gin Thr Arg 








20 






25 


30 


Lys 


Glu 


He 


Phe 


Leu 


Ser 


Arg Met Glu Gin 


He Leu Pro Trp Gin Asn 






ob 








An 




Met 


Val 


Glu 


Val 


He 


Glu 


Pro Phe Tyr Pro 


Lys Ala Gly Asn Gly Arg 




50 










55 


60 


Arg 


Pro 


Tyr 


Pro 


Leu 


Glu 


Thr Met Leu Arg 


He His Cys Met Gin His 


65 










70 




75 80 


Trp 


Tyr 


Asn 


Leu 


Ser 


Asp 


Gly Ala Met Glu 


Asp Ala Leu Tyr Glu He 










85 




90 


95 


Ala 


Ser 


Met 


Arg 


Leu 


Phe 


Ala Arg Leu Ser 


Leu Asp Ser Ala Leu Pro 








100 






105 


110 


Asp 


Arg 


Thr 


Thr 


He 


Met 


Asn Phe Arg His 


Leu Leu Glu Gin His Gin 






115 








120 


125 


Leu 


Ala 


Arg 


Gin 


Leu 


Phe 


Lys Thr He Asn 


Arg Trp Leu Ala Glu Ala 




130 










135 


140 


Gly 


Val 


Met 


Met 


Thr 


Gin 


Gly Thr Leu Val 


Asp Ala Thr He He Glu 


145 










150 




155 160 


Ala 


Pro 


Ser 


Ser 


Thr 


Lys 


Asn Lys Glu Gin 


Gin Arg Asp Pro Glu Met 










165 




170 


175 


His 


Gin 


Thr 


Lys 


Lys 


Gly 


Asn Gin Trp His 


Phe Gly Met Lys Ala His 








180 






185 


190 


He 


Gly 


Val 


Asp 


Ala 


Lys 


Ser Gly Leu Thr 


His Ser Leu Val Thr Thr 






195 








200 


205 


Ala 


Ala 


Asn 


Glu 


His 


Asp 


Leu Asn Gin Leu 


Gly Asn Leu Leu His Gly 




210 










215 


220 


Glu 


Glu 


Gin 


Phe 


Val 


Ser 


Ala Asp Ala Gly 


Tyr Gin Gly Ala Pro Gin 


225 










230 




235 240 


Arg 


Glu 


Glu 


Leu 


Ala 


Glu 


Val Asp Val Asp 


Trp Leu He Ala Glu Arg 










245 




250 


255 


Pro 


Gly 


Lys 


Val 


Arg 


Thr 


Leu Lys Gin His 


Pro Arg Lys Asn Lys Thr 








260 






265 


270 


Ala 


He 


Asn 


He 


Glu 


Tyr 


Met Lys Ala Ser 


He Arg Ala Arg Val Glu 






275 








280 


285 


His 


Pro 


Phe 


Arg 


He 


He 


Lys Arg Gin Phe 


Gly Phe Val Lys Ala Arg 
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290 295 300 

Tyr Lys Gly Leu Leu Lys Asn Asp Asn Gin Leu Ala Met Leu Phe Thr 
305 310 315 320 

Leu Ala Asn Leu Phe Arg Ala Asp Gin Met lie Arg Gin Trp Glu Arg 
325 330 335 

Ser His 



<210> 366 
<211> 157 
<212> PRT 

<213> Escherichia coli 
<400> 366 



Met 


Val 


Tyr He He 


He 


Val Ser His Gly 


His 


Glu Asp Tyr He 


Lys 


1 




5 




10 




15 




Lys 


Leu 


Leu Glu Asn 


Leu 


Asn Ala Asp Asp 


Glu 


His Tyr Lys He 


lie 






20 




25 




30 




Val 


Arg 


Asp Asn Lys 


Asp 


Ser Leu Leu Leu 


Lys 


Gin He Cys Gin 


His 






35 




40 




45 




Tyr 


Ala 


Gly Leu Asp 


Tyr 


He Ser Gly Gly 


Val 


Tyr Gly Phe Gly 


His 




50 






55 




60 




Asn 


Asn 


Asn lie Ala 


Val 


Ala Tyr Val Lys 


Glu 


Lys Tyr Arg Pro 


Ala 


65 






70 




75 




80 


Asp 


Asp 


Asp Tyr He 


Leu 


Phe Leu Asn Pro 


Asp 


He He Met Lys 


His 






85 




90 




95 




Asp 


Asp 


Leu Leu Thr 


Tyr 


He Lys Tyr Val 


Glu 


Ser Lys Arg Tyr 


Ala 






100 




105 




110 




Phe 


Ser 


Thr Leu Cys 


Leu 


Phe Arg Asp Glu 


Ala 


Lys Ser Leu His 


Asp 






115 




120 




125 




Tyr 


Ser 


Val Arg Lys 


Phe 


Pro Val Leu Ser 


Asp 


Phe He Val Ser 


Phe 




130 






135 




140 




Met 


Leu 


Gly He Lys 


Glu 


Gly Ala Asn Lys 


Ser 


Leu He 




145 






150 




155 







<210> 367 
<211> 372 
<212> PRT 

<213> Escherichia coli 



<400> 367 



Met 


Gly 


Lys 


Ser 


He 


Val 


Val Val Ser Ala 


Val 


Asn Phe Thr Thr 


Gly 


1 








5 




10 




15 




Gly 


Pro 


Phe 


Thr 


He 


Leu 


Lys Lys Phe Leu 


Ala 


Ala Thr Asn Asn 


Lys 








20 






25 




30 




Glu 


Asn 


Val 


Ser 


Phe 


He 


Ala Leu Val His 


Ser 


Ala Lys Glu Leu 


Lys 






35 








40 




45 




Glu 


Ser 


Tyr 


Pro 


Trp 


Val 


Lys Phe He Glu 


Phe 


Pro Glu Val Lys 


Gly 




50 










55 




60 




Ser 


Trp 


lieu 


Lys 


Arg 


Leu 


His Phe Glu Tyr 


Val 


Val Cys Lys Lys 


Leu 


65 










70 




75 




80 


Ser 


Lys 


Glu 


Leu 


Asn 


Ala 


Thr His Trp He 


Cys 


Leu His Asp He 


Thr 










85 




90 




95 




Ala 


Asn 


Val 


Val 


Thr 


Lys 


Lys Arg Tyr Val 


Tyr 


Cys His Asn Pro 


Ala 








100 






105 




110 




Pro 


Phe 


Tyr 


Lys 


Gly 


He 


Leu Phe Arg Glu 


He 


Leu Met Glu Pro 


Ser 






115 








120 




125 




Phe 


Phe 


Leu 


Phe 


Lys 


Met 


Leu Tyr Gly Leu 


He 


Tyr Lys He Asn 


He 




130 










135 




140 
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Lys 


Lys 


Asn 


Thr 


Ala 


Val 


Phe 


Val 


Gin 


Gin 


Phe 


Trp 


Met 


Lys 


Glu 


Lys 


145 










150 










155 










160 


Phe 


He 


Lys 


Lys 


Tyr 
165 


Ser 


He 


Asn 


Asn 


He 
170 


He 


Val 


Ser 


Arg 


Pro 
175 


Glu 


He 


Lys 


Leu 


Ser 
180 


Asp 


Lys 


Ser 


Gin 


Leu 
185 


Thr 


Asp 


Asp 


Asp 


Ser 
190 


Gin 


Phe 


Lys 


Asn 


Asn 
195 


Pro 


Ser 


Glu 


Leu 


Thr 
200 


He 


Phe 


Tyr 


Pro 


Ala 
205 


Val 


Pro 


Arg 


Val 


Phe 
210 


Lys 


Asn 


Tyr 


Glu 


Leu 
215 


He 


He 


Ser 


Ala 


Ala 
220 


Arg 


Lys 


Leu 


Lys 


Glu 


Gin 


Ser 


Asn 


He 


Lys 


Phe 


Leu 


Leu 


Thr 


He 


Ser 


Gly 


Thr 


Glu 


Asn 


225 










230 










235 










240 


Ala 


Tyr 


Ala 


Lys 


Tyr 
245 


He 


He 


Ser 


Leu 


Ala 
250 


Glu 


Gly 


Leu 


Asp 


Asn 

255. 


Val 


His 


Phe 


Leu 


Gly 


Tyr 


lieu Asp 


Lys 


Glu 


Lys 


He 


Asp 


His 


Cys 


Tyr 


Asn 








260 










265 










270 






lie 


Ser 


Asp 
275 


He 


Val 


Cys 


Phe 


Pro 
280 


Ser 


Arg 


Leu 


Glu 


Thr 
285 


Trp 


Gly 


Leu 


Pro 


Leu 


Ser 


Glu 


Ala 


Lys Glu 


Arg 


Gly 


Lys 


Trp 


Val 


Leu 


Ala 


Ser 


Asp 




290 










295 










300 










Phe 


Pro 


Phe 


Thr 


Arg 


Glu 


Thr 


Leu 


Gly 


Ser 


Tyr 


Glu 


Lys 


Lys 


Ala 


Phe 


305 










310 










315 










320 


Phe 


Asp 


Ser 


Asn 


Asn 


Asp Asp 


Met 


Leu 


Val 


Lys 


Leu 


He 


He 


Asp 


Phe 










325 










330 










335 




Lys 


Lys 


Gly 


Asn 


Leu 


Lys Lys 


Asp 


He 


Ser 


Asp 


Ala 


Asn 


Phe 


He 


Tyr 








340 










345 










350 






Arg 


Asn 


Glu 


Asn 


Val 


Leu val 


Gly 


Phe 


Asp 


Glu 


Leu 


Val 


Asn 


Phe 


He 






355 










360 










365 








Thr 


Glu 


Glu 


His 



























370 



<210> 368 
<211> 196 
<212> PRT 

<213> Escherichia coli 



<400> 368 



Met 


He 


Leu 


Lys 


Leu 


Ala Lys 


Arg 


Tyr 


Gly 


Leu 


Cys 


Gly 


Phe 


He 


Arg 


1 








5 








10 










15 




Leu 


Val 


Arg 


Asp 


Val 


Leu Leu 


Thr 


Arg 


Val 


Phe 


Tyr 


Arg 


Asn 


Cys 


Arg 








20 








25 










30 






He 


lie 


Arg 


Phe 


Pro 


Cys Tyr 


He 


Arg 


Asn 


Asp 


Gly 


Ser 


He 


Asn 


Phe 






35 








40 










45 








Gly 


Glu 


Asn 


Phe 


Thr 


Ser Gly 


val 


Gly 


Leu 


Arg 


Leu 


Asp 


Ala 


Phe 


Gly 




50 








55 










60 










Arg 


Gly 


Val 


He 


Phe 


Phe Ser 


Asp 


Asn 


Val 


Gin 


Val 


Asn 


Asp 


Tyr 


Val 


65 










70 








75 










80 


His 


He 


Ala 


Ser 


He 


Glu Ser 


Val 


Thr 


He 


Gly 


Arg 


Asp 


Thr 


Leu 


He 










85 








90 










95 




Ala 


Ser 


Lys 


Val 


Phe 


He Thr 


Asp 


His 


Asn 


His 


Gly 


Ser 


Phe 


Lys 


His 








100 








105 










110 






Ser 


Asp 


Pro 


Met 


Ser 


Ser Pro 


Asn 


He 


Pro 


Pro 


Asp 


Met 


Arg 


Thr 


Leu 






115 








120 










125 








Glu 


Ser 


Ser 


Ala 


Val 


Val He 


Gly 


Gin 


Arg 


Val 


Trp 


Leu 


Gly 


Glu 


Asn 




130 








135 










140 










Val 


Thr 


Val 


Leu 


Pro 


Gly Thr 


He 


He 


Gly 


Asn 


Gly 


Val 


Val 


Val 


Gly 


145 










150 








155 










160 


Ala 


Asn 


Ser 


Val 


Val 


Arg Gly 


Ser 


He 


Pro 


Glu 


Asn 


Thr 


Val 


He 


Ala 










165 








170 










175 




Gly 


Val 


Pro 


Ala 


Lys 


He He 


Lys 


Lys 


Tyr 


Asn 


His 


Glu 


Thr 


Lys 


Leu 
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180 

Trp Glu Lys Ala 
195 



185 



190 



<210> 369 
<211> 330 
<212> PRT 

<213> Escherichia coli 



<400> 369 



Met 


Tyr 


Phe 


Leu Asn 


Asp Leu Asn 


Phe Ser Arg 


Arg Asp Ala 


Gly Phe 


1 






5 




10 




15 


Lys 


Ala 


Arg 


Lys Asp 


Ala Leu Asp 


He Ala Ser 


Asp Tyr Glu 


Asn He 








20 




25 


30 




Ser 


Val 


Val 


Asn He 


Pro Leu Trp 


Gly Gly Val 


Val Gin Arg 


He He 






35 




40 




45 




Ser 


Ser 


Val 


Lys Leu 


Ser Thr Phe 


Leu Cys Gly 


Leu Glu Asn 


Lys Asp 




50 






55 




60 




Val 


Leu 


He 


Phe Asn 


Phe Pro Met 


Ala Lys. Pro 


Phe Trp His 


He Leu 


65 








70 


75 




80 


Ser 


Phe 


Phe 


His Arg 


Leu Leu Lys 


Phe Arg He 


Val Pro Leu 


He His 








85 




90 




95 


Asp 


lie 


Asp 


Glu Leu 


Arg Gly Gly 


Gly Gly Ser 


Asp Ser Val 


Arg Leu 








100 




105 


110 




Ala 


Thr 


Cys 


Asp Met 


Val He Ser 


His Asn Pro 


Gin Met Thr 


Lys Tyr 






115 




120 




125 




Leu 


Ser 


Lys 


Tyr Met 


Ser Gin Asp 


Lys He Lys 


Asp He Lys 


He Phe 




130 






135 




140 




Asp 


Tyr 


Leu 


Val Ser 


Ser Asp Val 


Glu His Arg 


Asp Val Thr 


Asp Lys 


145 








150 


155 


/ 


160 


Gin 


Arg 


Gly 


Val He 


Tyr Ala Gly 


Asn Leu Ser 


Arg His Lys 


Cys Ser 








165 




170 




175 


Phe 


lie 


Tyr 


Thr Glu 


Gly Cys Asp 


Phe Thr Leu 


Phe Gly Val 


Asn Tyr 








180 




185 


190 




Glu 


Asn 


Lys 


Asp Asn 


Pro Lys Tyr 


Leu Gly Ser 


Phe Asp Ala 


Gin Ser 






195 




200 




205 




Pro 


Glu 


Lys 


He Asn 


Leu Pro Gly 


Met Gin Phe 


Gly Leu He 


Trp Asp 




210 






215 




220 




Gly 


Asp 


Ser 


Val Glu 


Thr Cys Ser 


Gly Ala Phe 


Gly Asp Tyr 


Leu Lys 


225 








230 


235 




240 


Phe 


Asn 


Asn 


Pro His 


Lys Thr Ser 


Leu Tyr Leu 


Ser Met Glu 


Leu Pro 








245 




250 




255 


Val 


Phe 


He 


Trp Asp 


Lys Ala Ala 


Leu Ala Asp 


Phe He Val 


Asp Asn 








260 




265 


270 




Arg 


He 


Gly 


Tyr Ala 


Val Gly Ser 


He Lys Glu 


Met Gin Glu 


He Val 






275 




280 




285 




Asp 


Ser 


Met 


Thr He 


Glu Thr Tyr 


Lys Gin He 


Ser Glu Asn 


Thr Lys 




290 






295 




300 




lie 


lie 


Ser 


Gin Lys 


He Arg Thr 


Gly Ser Tyr 


Phe Arg Asp 


Val Leu 


305 








310 


315 




320 


Glu 


Glu 


Val 


He Asp 


Asp Leu Lys 


Thr Arg 












325 




330 







<210> 370 
<211> 388 
<212> PRT 

<213> Escherichia coli 
<400> 370 
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<210> 371 
<211> 367 
<212> PRT 

<213> Escherichia coli 
<400> 371 

Met Tyr Asp Tyr He He Val Gly Ser Gly Leu Phe Gly Ala Val Cys 

1 5 10 15 

Ala Asn Glu Leu Lys Lys Leu Asn Lys Lys Val Leu Val He Glu Lys 
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Lys 


Ser 


Thr 


Leu 


Leu 


Gly 


Glu 


65 










70 






Leu 


Pro 


Phe 


Leu 


Phe 
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10 
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<210> 375 
<211> 291 
<212> PRT 

<213> Escherichia coli 
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65 70 75 80 

Pro Thr Ala Ser Asp Leu Arg Leu Val Met Val lie Ser Lys Thr lie 
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Ala Glu lieu Glu Arg lie Gly Asp Val Ala Asp Lys He Cys Arg Thr 
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Glu Ser Leu Gly Arg His Thr lie Gin Met Leu His Asp Val Leu Asp 
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Asp Lys Lys Val Asp Gin Glu Tyr Glu Gly He Val Arg Gin Leu Met 
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Thr Tyr Met Met Glu Asp Ser Arg Thr He Pro Ser Val Leu Thr Ala 

180 185 190 
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He 


Glu 


Phe 


Ser 


Asn 


Thr 


Asp 


Asp 


Leu 


Phe 



225 230 235 240 
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Thr Lys Pro Ala Lys Lys Gin Thr Glu Asp Tyr lie Thr Gly Arg Tyr 
245 250 255 

Gly 



<210> 378 
<211> 296 
<212> PRT 

<213> Escherichia coli 



<400> 378 



Met 


Ala 


Met Val 


Glu Met Gin 


Thr Thr Ala 


Ala 


Leu 


Ala 


Glu 


Ser 


Arg 


1 






5 


10 










15 




Arg 


Lys 


Met Gin 


Ala Arg Arg 


Arg Leu Lys 


Asn 


Arg 


He 


Ala 


Leu 


Thr 






20 




25 








30 






Leu 


Ser 


Met Ala 


Thr Met Ala 


Phe Gly Leu 


Phe 


Trp 


Leu 


lie 


Trp 


He 






35 




40 






45 








Leu 


Met 


Ser Thr 


He Thr Arg 


Gly He Asp 


Gly 


Met 


Ser 


Leu 


Ala 


Leu 




50 




55 






60 










Phe 


Thr 


Glu Met 


Thr Pro Pro 


Pro Asn Thr 


Glu 


Gly 


Gly 


Gly 


Leu 


Ala 


65 






70 




75 










80 


Asn 


Ala 


Leu Ala 


Gly Ser Gly 


Leu Leu He 


Leu 


Trp 


Ala 


Thr 


Val 


Phe 








85 


90 










95 




Gly 


Thr 


Pro Leu 


Gly He Met 


Ala Gly He 


Tyr 


Leu 


Ala 


Glu 


Tyr 


Gly 






100 




105 








110 






Arg 


Lys 


Ser Trp 


Leu Ala Glu 


Val He Arg 


Phe 


He 


Asn 


Asp 


He 


Leu 






115 




120 






125 








Leu 


Ser 


Ala Pro 


Ser He Val 


Val Gly Leu 


Phe 


Val 


Tyr 


Thr 


He 


Val 








1 "3C 
J.J9 
















Val 


Ala 


Gin Met 


Glu His Phe 


Ser Gly Trp 


Ala 


Gly 


Val 


He 


Ala 


Leu 


145 






150 




155 










160 


Ala 


Leu 


Leu Gin 


Val Pro He 


Val He Arg 


Thr 


Thr 


Glu 


Asn 


Met 


Leu 








165 


170 










175 




Lys 


Leu 


Val Pro 


Tyr Ser Leu 


Arg Glu Ala 


Ala 


Tyr 


Ala 


Leu 


Gly 


Thr 






180 




185 








190 






Pro 


Lys 


Trp Lys 


Met He Ser 


Ala He Thr 


Leu 


Lys 


Ala 


Ser 


Val 


Ser 






195 




200 






205 








Gly 


He 


Met Thr 


Gly He Leu 


Leu Ala He 


Ala 


Arg 


He 


Ala 


Gly 


Glu 




210 




215 






220 










Thr 


Ala 


Pro Leu 


Leu Phe Thr 


Ala Leu Ser 


Asn 


Gin 


Phe 


Trp 


Ser 


Thr 


225 






230 




235 










240 


Asp 


Met 


Met Gin 


Pro He Ala 


Asn Leu Pro 


Val 


Thr 


He 


Phe 


Lys 


Phe 








245 


250 










255 




Ala 


Met 


Ser Pro 


Phe Ala Glu 


Trp Gin Gin 


Leu 


Ala 


Trp 


Ala 


Gly 


Val 






260 




265 








270 






Leu 


He 


He Thr 


Leu Cys Val 


Leu Leu Leu 


Asn 


He 


Leu 


Ala 


Arg 


Val 






275 




280 






285 








Val 


Phe 


Ala Lys 


Asn Lys His 


Gly 















290 295 



<210> 379 
<211> 319 
<212> PRT 

<213> Escherichia coli 
<400> 379 

Met Ala Ala Thr Lys Pro Ala Phe Asn Pro Pro Gly Lys Lys Gly Asp 

1 5 10 15 

He He Phe Ser Val Leu Val Lys Leu Ala Ala Leu He Val Leu Leu 
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20 



Met 


Leu 


Gly 


Gly He He 


Val Ser 






35 




40 


He 


Gin 


Lys 


Phe Gly Leu 


Ala Phe 




50 






55 


Pro 


Asn 


Asp 


He Tyr Gly 


Ala Leu 


65 






70 




Thr 


Ser 


Phe 


He Ala Leu 


Leu He 








85 




Ala 


Leu 


Phe 


Leu Thr Glu 


Leu Ala 








100 




Gly 


He 


Ala 


He Glu Leu 


Leu Ala 






115 




120 


Met 


Trp 


Gly 


Leu Phe He 


Phe Ala 




130 






135 


Glu 


Pro 


Val 


Gly Asn He 


Met Ser 


145 






150 




Phe 


Ser 


Gly 


Pro Ala Phe 


Gly He 








165 




Leu 


Ala 


He 


Met He He 


Pro Tyr 








180 




Phe 


Glu 


Gin 


Thr Pro Val 


Met Met 






195 




200 


Cys 


Thr 


Thr 


Trp Glu Val 


He Trp 




210 






215 


Asn 


Gly 


Val 


He Gly Gly 


He Met 


225 






230 




Glu 


Thr 


Met 


Ala Val Thr 


Phe He 








245 




Ser 


Ala 


Ser 


Leu Tyr Met 


Pro Gly 








260 




Asn 


Glu 


Phe 


Ala Glu Ala 


Glu Ser 






275 




280 


Glu 


Leu 


Gly 


Leu He Leu 


Phe Val 




290 






295 


Ser 


Lys 


Phe 


Met He Met 


Arg Leu 


305 






310 





<210> 380 
<211> 346 
<212> PRT 

<213> Escherichia coli 



<400> 380 



Met 


Lys 


Val 


Met 


Arg 


Thr 


Thr Val 


1 








5 






Ser 


Met 


Ser 


Ala 
20 


Phe 


Ser 


Val Phe 


Gly 


Ala 


Thr 
35 


Phe 


Pro 


Ala 


Pro Val 
40 


Gin 


Lys 
50 


Glu 


Thr 


Gly 


Asn 


Lys Val 
55 


Gly 


Gly 


Val 


Lys 


Gin 


He 


He Ala 


65 










70 




Asp 


Ala 


Pro 


Leu 


Ser 
85 


Asp 


Glu Lys 


Phe 


Pro 


Thr 


Val 
100 


He 


Gly 


Gly Val 


Leu 


Lys 


Ser 
115 


Gly 


Glu 


Leu 


Val Leu 
120 



25 30 



Leu He He 


Ser 


Ser 


Trp 


Pro 


Ser 






45 








Leu Trp Thr 


Lys 


Glu 


Trp 


Asp 


Ala 




60 










Val Pro He 


Tyr 


Gly 


Thr 


Leu 


Val 


75 










80 


Ala Val Pro 


Val 


Ser 


Phe 


Gly 


He 


90 








95 




Pro Gly Trp 


Leu 


Lys 


Arg 


Pro 


Leu 


105 






110 






Ala He Pro 


Ser 


He 


Val 


Tyr 


Gly 






125 








Pro Leu Phe 


Ala 


Val 


Tyr 


Phe 


Gin 




140 










Asn He Pro 


He 


Val 


Gly 


Ala 


Leu 


155 










160 


Gly He Leu 


Ala 


Ala 


Gly 


Val 


lie 


170 








175 




He Ala Ala 


Val 


Met 


Arg 


Asp 


Val 


185 






190 






Lys Glu Ser 


Ala 


Tyr 


Gly 


He 


Gly 






205 








Arg He Val 


Leu 


Pro 


Phe 


Thr 


Lys 




220 










Leu Gly Leu 


Gly 


Arg 


Ala 


Leu 


Gly 


235 










240 


He Gly Asn 


Thr 


Tyr 


Gin 


Leu 


Asp 


250 








255 




Asn Ser He 


Thr 


Ser 


Ala 


Leu 


Ala 


265 






270 






Gly Leu His 


Val 


Ala 


Ala 


Leu 


Met 






285 








He Thr Phe 


He 


Val 


Leu 


Ala 


Ala 




300 










Ala Lys Asn 


Glu 


Gly 


Ala 


Arg 




315 












Ala Thr Val 


Val 


Ala 


Ala 


Thr 


Leu 


10 








15 




Ala Glu Ala 


Ser 


Leu 


Thr 


Gly 


Ala 


25 






30 






Tyr Ala Lys 


Trp 


Ala 


Asp 


Thr 


Tyr 






45 








Asn Tyr Gin 


Gly 


He 


Gly 


Ser 


Ser 




60 










Asn Thr Val 


Asp 


Phe 


Gly 


Ala 


Ser 


75 










80 


Leu Ala Gin 


Glu 


Gly 


Leu 


Phe 


Gin 


90 








95 




Val Leu Ala 


Val 


Asn 


He 


Pro 


Gly 


105 






110 






Asp Gly Lys 


Thr 


Leu 


Gly 


Asp 


He 



125 
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Tyr 


Leu 


Gly 


Lys 


lie 


Lys Lys 


Trp 
















Asn 


Pro 


Gly 


Leu 


Lys 


Leu Pro 


Ser 


145 










*1 C A 




Ala 


Asp 


Gly 


Ser 


Gly 


Thr Ser 












ICC 






Val 


Asn 


Glu 


Glu 


Trp 


Lys Asn 


Asn 








180 








Trp 


Pro 


He 


Gly 


Leu 


Gly Gly 


Lys 






IOC 








•inn 


Val 


Gin 


Arg 


Leu 


Pro 


Gly Ala 


He 




210 








215 




Lys 


Gin 


Asn 


Asn 


Leu 


Ala Tyr 


Thr 


225 










230 




Pro 


Val 


Ser 


Pro 


Thr 


Glu Glu 


Asn 










245 






Asp 


Trp 


Ser 


Lys 


Thr 


Phe Ala 


Gin 








260 








Asp 


Ala 


Trp 


Pro 


He 


Thr Ser 


Thr 






Aid 








on a 


Gin 


Lys 


Lys 


Pro 


Glu 


Gin Gly 


Thr 




290 








295 




Ala 


Tyr 


Lys 


Thr 


Gly 


Ala Lys 


Gin 


305 










310 




Leu 


Pro 


Asp 


Ser 


Val 


^al Glu 


Gin 










325 






He 


Lys 


Asp 


Ser 


Ser 


Gly Lys 


Pro 



340 



ASp 


Asp 


G1U 


»i 9 
Aia 

140 


lie 


Aid 


Lys 


Leu 


r*l n 

bin 


Asn 


lie 
155 


Ala 


vol 


val 


Arg 


Arg 
160 


Val 


pne 
170 


Thr 


Ser 


Tyr 


Leu 


A 1 a 
175 


Lys 


vai 


Giy 


Thr 


Giy 


Cay 


Thr 


val 


Lys 


185 










190 






Gly 


Asn 


Asp 


r»l ir 

tjiy 


Tl a 

lie 


Ala 


TVl a 
iua 


•trie 


Gly 


Tyr 


vai 


VjIU 

220 


Tyr 


Ala 


xyr 


iiia 


Lys 


Leu 


lie 
235 


Ser 


TV 1 — 

Ala 


T\ 

ASp 


Gly 


Lys 
240 


Pne 


TV 1 — 

Ala 

250 


Asn 


Ala 


Ala 


Lys 


Gly 
255 


Aia 


Asp 


Leu 


Thr 


Asn 


Gin 


Lys 


Gly 


Glu 


265 










270 






Thr 


Phe 


He 


Leu 


He 
285 


His 


Lys 


Asp 


Glu 


Val 


Leu 


Lys 
300 


Phe 


Phe 


Asp 


Trp 


Ala 


Asn 


Asp 
315 


Leu 


Asp 


Tyr 


Ala 


Ser 
320 


Val 


Arg 
330 


Ala 


Ala 


Trp 


Lys 


Thr 
335 


Asn 



Leu Tyr 
345 



<210> 381 

<211> 236 

<212> PRT 

<213> Escherichia coli 



<400> 381 



Met 


Gly 


Ser 


Gly 


1 








Ser 


Phe 


Thr 


Val 








20 


Gin 


Cys 


Val 


Lys 






35 




Ala 


Ala 


Leu 


Asp 




50 






Ala 


Thr 


Ser 


Leu 


65 








Thr 


Leu 


Tyr 


Ala 


Leu 


Asp 


Arg 


Tyr 








100 


Phe 


Cys 


Glu 


Glu 






115 




Ala 


He 


Ala 


Asn 




130 






Met 


He 


Asn 


Asn 


145 








Asn 


Thr 


Leu 


Lys 


Phe 


Leu 


Cys 


Gin 








180 


Val 


Thr 


His 


Leu 



Leu 


Val Asn 


Gly 


5 






Thr 


Arg Tyr 


Asn 


Lys 


Ser Arg 


Gly 






40 


Lys 


Ala Met 


Lys 




55 




Ala 


Asp Leu 


Val 




70 




Glu 


Phe Thr 


Asn 


85 






He 


Asp Arg 


Phe 


Lys 


Ser Val 


Glu 






120 


Cys 


Phe Thr 


Ser 




135 




Cys 


Thr Thr 


Leu 




150 




Ser 


Arg His 


Ala 


165 






Arg 


Gin Ala 


Arg 


Ala 


Glu Phe 


Leu 



Gly 


Asp 


Tyr 


Phe 




10 






Gly 


He 


Met 


Ala 


25 








Arg 


Pro 


Lys 


Val 


Leu 


Phe 


Trp 


Gin 








60 


Glu 


Ala 


Thr 


Gly 






75 




Lys 


Glu 


Gly 


Leu 




90 






Ala 


Ala 


Lys 


His 


105 








Ser 


Ala 


Leu 


Ala 


Lys 


Asp 


Thr 


Pro 








140 


Ser 


Pro 


Asp 


Ser 






155 




Met 


Gin 


Glu 


Arg 




170 






Gly 


Glu 


He 


Pro 


185 








Asn 


Cys 


He 


He 



Tyr 


Asn 


Asn 


Leu 






15 




Thr 


Asp 


Ser 


Thr 




30 






Phe 


Asp 


Arg 


Asp 


45 








His 


Gly 


Tyr 


Glu 


Ala 


Lys 


Ala 


Pro 








80 


Phe 


Arg 


Ala 


Val 






95 




Glu 


Ala 


Gin 


Leu 




110 






Asp 


Tyr 


Phe 


Ala 


125 








Ala 


Gly 


Cys 


Phe 


Gly 


Asp 


He 


Ala 








160 


Thr 


Leu 


Gin 


Gin 






175 




Pro 


His 


Cys 


Asp 




190 






Gin 


Gly 


Met 


Ser 
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195 200 205 

He Ser Ala Arg Glu Gly Ala Ser Leu Glu Lys Leu Met Gin He Ala 

210 215 220 

Gly Thr Thr Leu Arg Leu Trp Pro Glu Leu Val Lys 
225 230 235 



<210> 382 
<211> 181 
<212> PRT 

<213> Escherichia coli 



<400> 382 














Met 


Gin 


Ala 


Lys 


He 


Ala Ala 


Ser Asn Thr Gly Glu 


Leu Asp Ala 


Leu 


1 








5 




10 


15 




Gin 


Gin 


Leu 


Gly 


Phe 


Ser Leu 


Val Glu Gly Glu Val 


Asp Leu Ala 


Leu 








20 






25 


30 




Pro 


Val 


Asn 


Asn 


Ala 


Ser Asp 


Ser Gly Ala Val Val 


Ala Gin Glu 


Thr 






35 








40 


45 




Asp 


He 


Pro 


Ala 


Leu 


Arg Gin 


Leu Ala Ser Ala Ala 


Phe Ala Gin 


Ser 




50 








55 


60 






Arg 


Phe 


Arg 


Ala 


Pro 


Trp Tyr 


Ala Pro Asp Ala Ser 


Ser Arg Phe 


Tyr 


65 










70 


75 




80 


Ala 


Gin 


Trp 


He 


Glu 


Asn Ala 


Val Arg Gly Thr Phe 


Asp His Gin 


Cys 










85 




90 


95 




Leu 


He 


Leu 


Arg 


Ala 


Ala Ser 


Gly Asp He Arg Gly 


Tyr Val Ser 


Leu 








100 






105 


110 




Arg 


Glu 


Leu 


Asn 


Ala 


Thr Asp 


Ala Arg He Gly Leu 


Leu Ala Gly 


Arg 






115 








120 


125 




Gly 


Ala 


Gly 


Ala 


Glu 


Leu Met 


Gin Thr Ala Leu Asn 


Trp Ala Tyr 


Arg 




130 








135 


140 






Arg 


Gly 


Lys 


Thr 


Thr 


Leu Arg 


Val Ala Thr Gin Met 


Gly Asn Thr 


Ala 


145 










150 


155 




160 


Ala 


Leu 


Lys 


Arg 


Tyr 


He Gin 


Ser Gly Ala Asn Val 


Glu Ser Thr 


Ala 










165 




170 


175 




Tyr 


Trp 


Leu 


Tyr 


Arg 
















180 













<210> 383 
<211> 376 
<212> PRT 

<213> Escherichia coli 



<400> 383 



Met 


He 


Pro 


Phe 


Asn 


Ala Pro 


Pro Val Val Gly Thr 


Glu Leu Asp 


Tyr 


1 








5 




10 


15 




Met 


Gin 


Ser 


Ala 


Met 


Gly Ser 


Gly Lys Leu Cys Gly 


Asp Gly Gly 


Phe 








20 






25 


30 




Thr 


Arg 


Arg 


Cys 


Gin 


Gin Trp 


Leu Glu Gin Arg Phe 


Gly Ser Ala 


Lys 






35 








40 


45 




Val 


Leu 


Leu 


Thr 


Pro 


Ser Cys 


Thr Ala Ser Leu Glu 


Met Ala Ala 


Leu 




50 








55 


60 






Leu 


Leu 


Asp 


He 


Gin 


Pro Gly 


Asp Glu Val He Met 


Pro Ser Tyr 


Thr 


65 










70 


75 




80 


Phe 


Val 


Ser 


Thr 


Ala 


Asn Ala 


Phe Val Leu Arg Gly 


Ala Lys He 


Val 










85 




90 


95 




Phe 


Val 


Asp 


Val 


Arg 


Pro Asp 


Thr Met Asn He Asp 


Glu Thr Leu 


He 








100 






105 


110 




Glu 


Ala 


Ala 


He 


Thr 


Asp Lys 


Thr Arg Val He Val 


Pro Val His 


Tyr 






115 








120 


125 
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Ala 


Gly 


Val Ala Cys Glu 


Met 


Asp 




130 




135 




His 


Asn 


Leu Phe Val Val 


Glu 


Asp 


145 




150 






Tyr 


Lys 


Gly Arg Ala Leu 


Gly 


Thr 






165 






Phe 


His 


Glu Thr Lys Asn 


Tyr 


Thr 






180 






Leu 


lie 


Asn Asp Lys Ala 


Leu 


He 






195 




200 


Lys 


Gly 


Thr Asn Arg Ser 


Gin 


Phe 




210 




215 




Thr 


Trp 


Arg Asp lie Gly 


Ser 


Ser 


225 




230 






Ala 


Tyr 


Leu Trp Ala Gin 


Leu 


Glu 






245 






Arg 


Leu 


Ala Leu Trp Gin 


Asn 


Tyr 






260 






Lys 


Ala 


Gly Arg lie Glu 


Leu 


Pro 






275 




280 


Asn 


Ala 


His Met Phe Tyr 


He 


Lys 




290 




295 




Ala 


Leu 


He Asn Phe Leu 


Lys 


Glu 


305 




310 






Tyr 


lie 


Pro Leu His Gly 


Cys 


Pro 






325 






His 


Gly 


Glu Asp Arg Tyr 


Thr 


Thr 






340 






Leu 


Pro 


Leu Phe Tyr Asn 


Leu 


Ser 






355 




360 


Ala 


Thr 


Leu Leu Asn Tyr 


Phe 


Ser 




370 




375 





<210> 384 
<211> 416 
<212> PRT 

<213> Escherichia coli 



<400> 384 



Met 


Ser 


Leu 


Ala 


Lys 


Ala 


Ser 


Leu 


1 








5 








Lys 


He 


Gly 


Ala 


Gly 


Leu 


Leu 


Val 








20 










Gly 


Pro 


Ala 


Gly 


Leu 


Gly 


Leu 


Ala 






35 










40 


Val 


Leu 


Gly 


Val 


Leu 


Ala 


Gly 


Ala 




50 










55 




Tyr 


Val 


Ala 


Gin 


Tyr 


His 


Asp 


Asn 


65 










70 






Gly 


Thr 


Ser 


Ser 


Ala 


Met 


Val 


Leu 










85 








Val 


Phe 


Val 


Leu 


Ala 


Ala 


Ala 


Pro 








100 










Thr 


Asp 


Tyr 


Gin 


Gly 


Leu 


Val 


Arg 






115 










120 


He 


Ala 


Trp 


Gly 


Asn 


Leu 


Leu 


Leu 




130 










135 




Ala 


Ala 


Gly 


Asn 


Ala 


Leu 


Ser 


Leu 


145 










150 






Leu 


Ala 


Tyr 


Tyr 


Val 


Ser 


Tyr 


Arg 



Thr 


He 


Met 


Ala 


Leu 


Ala 


Lys 


Lys 








140 










Ala 


Ala 


Gin 


Gly 


Val 


Met 


Ser 


Thr 






155 










160 


lie 


Gly 


His 


He 


Gly 


Cys 


Phe 


Ser 




170 










175 




Ala 


Gly 


Gly 


Glu 


Gly 


Gly 


Ala 


Thr 


185 










190 






Glu 


Arg 


Ala 


Glu 


He 


He 


Arg 


Glu 










205 








Phe 


Arg 


Gly 


Gin 


Val 


Asp 


Lys 


Tyr 








220 










Tyr 


Leu 


Met 


Ser 


Asp 


Leu 


Gin 


Ala 






235 










240 


Ala 


Ala 


Asp 


Arg 


He 


Asn 


Gin 


Gin 




250 










255 




Tyr 


Asp 


Ala 


Leu 


Ala 


Pro 


Leu 


Ala 


265 










270 






Ser 


He 


Pro 


Asp 


Gly 


Cys 


Val 


Gin 










285 








Leu 


Arg 


Asp 


He 


Asp 


Asp 


Arg 


Ser 








300 










Ala 


Glu 


He 


Met 


Ala 


Val 


Phe 


His 






315 










320 


Ala 


Gly 


Glu 


His 


Phe 


Gly 


Glu 


Phe 




330 










335 




Lys 


Glu 


Ser 


Glu 


Arg 


Leu 


Leu 


Arg 


345 










350 






Pro 


Val 


Asn 


Gin 


Arg 


Thr 


Val 


He 



365 



Trp 


Thr 
10 


Ala 


Ala 


Ser 


Thr 


Leu 
15 


Val 


Gly 


Lys 


Leu 


Leu 


Ala 


Val 


Ser 


Phe 


25 










30 






Ala 


Asn 


Phe 


Arg 


Gin 
45 


Leu 


He 


Thr 


Gly 


He 


Phe 


Asn 
60 


Gly 


Val 


Thr 


Lys 


Pro 


Gin 


Gin 
.75 


Leu 


Arg 


Arg 


Val 


Val 
80 


Gly 


Phe 
90 


Ser 


Thr 


Leu 


Met 


Ala 
95 


Leu 


He 


Ser 


Gin 


Gly 


Leu 


Phe 


Gly 


Asn 


105 










110 






Leu 


Val 


Ala 


Leu 


Val 
125 


Gin 


Met 


Gly 


Ala 


Leu 


Met 


Lys 
140 


Gly 


Phe 


Arg 


Asp 


He 


Val 


Gly 
155 


Ser 


Leu 


He 


Gly 


Val 
160 


Leu 


Gly 


Gly 


Tyr 


Glu 


Gly 


Ala 


Leu 
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165 




170 








1 / D 




Leu Gly 


Leu 


Ala 


Leu 


lie 


Pro Ala Leu Val 


Val 


He 


Pro 


Ala Ala 


Tl- 








180 






loo 








i on 
iyu 




Met 


Leu 


lie 


Lys Arg 


Gly 


val lie Pro Leu 


Ser 


Tyr 


Leu 


Lys Pro 


Ser 






iyo 














one 

20o 
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530 535 540 

Asp Ala Ser lie Pro Pro Ala Thr Gin Ser His Gly 
545 550 555 
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Pro Asn Ser Val Thr Arg Gly Val lie Cys Trp Pro Gly Gly Gin Ser 

580 585 590 

Leu Pro Glu Gly Asp Gly Asn Cys Arg Arg Arg Leu Ala Thr Trp Leu 

595 600 605 

Leu Asp Gly Ser Gin Pro Pro Thr Leu Leu Leu Pro Glu Gin Glu Gly 

610 615 620 

lie Asn Gly He Arg Phe Pro He Trp Leu Asp Glu Asn Gly Lys Arg 
625 "* 630 635 640 

Val Ala Ala Asp Cys Pro Gin Ala Arg Gin Glu Met He Asn Val Trp 

645 650 655 

Pro Leu Pro Leu Glu Pro Trp Leu Pro Ala Ser Glu Arg Arg Ala Val 

660 665 670 

Arg Leu Pro Pro Ala. Ser Thr Ser Cys Pro Pro Tyr Gly His Asp Ala 

675 680 685 

Gin Leu Pro Leu Gin Leu Thr Gly Val Arg Asp Gly Ala He He Lys 

690 695 700 

Arg Leu Pro Gly Ala Ala Glu Ala Thr Leu Pro Leu Gin Ser Ser Gly 
705 " 710 715 720 

Gly Ala Gly Glu Arg Trp Trp Phe Leu Asn Gly Glu Pro Leu Thr Glu 

725 730 735 

Arg Gly Arg Asn Val Thr Leu His Leu Thr Asp Lys Gly Asp Tyr Gin 

740 • 745 750 

Leu Leu Val Met Asp Asp Val Gly Gin He Ala Thr Val Lys Phe Val 
755 760 765 

Met Gin 
770 



<210> 402 
<211> 1653 
<212> PRT 

<213> Escherichia coli 
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Lys Ala Asp Glu Thr Gly Lys Val Ser Phe Pro Val Glu Trp Gly Ala 
705 710 ~ 715 720 

Tyr Arg Leu Glu Val Lys Ala Pro Asn Glu Ala Val Ser Ser Val Arg 

725 730 735 

Phe Trp Ala Gly Tyr Ser Trp Gin Asp Asn Ser Asp Gly Ser Gly Ala 

740 745 750 

Val Arg Pro Asp Arg Val Thr Leu Lys Leu Asp Lys Ala Ser Tyr Arg 

755 760 765 

Pro Gly Asp Thr He Lys Leu His He Ala Ala Pro Thr Ala Gly Lys 

770 775 780 

Gly Tyr Ala Met Val Glu Ser Ser Glu Gly Pro Leu Trp Trp Gin Glu 
785 790 795 800 

He Asp Val Arg Ala Gin Gly Leu Asp Leu Thr He Pro Val Asp Lys 

805 810 815 

Thr Trp Asn Arg His Asp Leu Tyr Leu Ser Thr Leu Val Val Arg Pro 

820 825 830 

Gly Asp Lys Ser Arg Ser Ala Thr Pro Lys Arg Ala Val Gly Val Leu 

835 " 840 845 

His Leu Pro Leu Gly Asp Glu Asn Arg Arg Leu Asp Leu Ala Leu Glu 

850 855 860 

Thr Pro Ala Lys Met Arg Pro Asn Gin Pro Leu Thr Val Lys He Lys 
865 870 875 880 

Ala Ser Thr Lys Asn Gly Glu Lys Pro Lys Gin Val Asn Val Leu Val 

885 890 895 

Ser Ala Val Asp Ser Gly Val Leu Asn He Thr Asp Tyr Val Thr Pro 

900 905 910 

Asp Pro Trp Gin Ala Phe Phe Gly Gin Lys Arg Tyr Gly Ala Asp He 

915 920 925 

Tyr Asp He Tyr Gly Gin Val He Glu Gly Gin Gly Arg Leu Ala Ala 

930 " 935 940 

Leu Arg Phe Gly Gly Asp Gly Asp Glu Leu Lys Arg Gly Gly Lys Pro 
945 " 950 955 960 

Pro Val Asn His Val Asn He Val Val Gin Gin Ala Leu Pro Val Thr 

965 970 975 

Leu Asn Glu Gin Gly Glu Gly Ser Val Thr Leu Pro He Gly Asp Phe 

980 985 990 

Asn Gly Glu Leu Arg Val Met Ala Gin Ala Trp Thr Ala Asp Asp Phe 

995 1000 1005 

Gly Ser Asn Glu Ser Lys Val He Val Ala Ala Pro Val lie Ala Glu 

1010 1015 1020 

Leu Asn Met Pro Arg Phe Met Ala Ser Gly Asp Thr Ser Arg Leu Thr 
1025 1030 1035 1040 

Leu Asp He Thr Asn Leu Thr Asp Lys Pro Gin Lys Leu Asn Val Ala 

1045 1050 1055 

Leu Thr Ala Ser Gly Leu Leu Glu Leu Val Ser Asp Ser Pro Ala Ala 

1060 ~ 1065 1070 

Val Glu Leu Ala Pro Gly Val Arg Thr Thr Leu Phe He Pro Val Arg 

1075 1080 1085 

Ala Leu Pro Gly Tyr Gly Asp Gly Glu He Gin Ala Thr He Ser Gly 

1090 ~ 1095 1100 • 

Leu Ala Leu Pro Gly Glu Thr Val Ala Asp Gin His Lys Gin Trp Lys 
1105 HIO 1H5 1120 

He Gly Val Arg Pro Ala Phe Pro Ala Gin Thr Val Asn Tyr Gly Thr 

1125 1130 • 1135 

Ala Leu Gin Pro Gly Glu Thr Trp Ala He Pro Ala Asp Gly Leu Gin 

1140 1145 1150 

Asn Phe Ser Pro Val Thr Leu Glu Gly Gin Leu Leu Leu Ser Gly Lys 

1155 1160 1165 

Pro Pro Leu Asn He Ala Arg Tyr He Lys Glu Leu Lys Ala Tyr Pro 

1170 1175 1180 

Tyr Gly Cys Leu Glu Gin Thr Ala Ser Gly Leu Phe Pro Ser Leu Tyr 



-413- 



WO 01/48209 



PCT/USOO/34419 



1185 1190 1195 1200 

Thr Asn Ala Ala Gin Leu Gin Ala Leu Gly lie Lys Gly Asp Ser Asp 

1205 1210 1215 

Glu Lys Arg Arg Ala Ser Val Asp lie Gly lie Ser Arg Leu Leu Gin 

1220 1225 1230 

Met Gin Arg Asp Asn Gly Gly Phe Ala Leu Trp Asp Lys Asn Gly Asp 

1235 1240 1245 

Glu Glu Tyr Trp Leu Thr Ala Tyr Val Met Asp Phe Leu Val Arg Ala 

1250 1255 1260 

Gly Glu Gin Gly Tyr Ser Val Pro Thr Asp Ala lie Asn Arg Gly Asn 
1265 1270 1275 1280 

Glu Arg Leu Leu Arg Tyr Leu Gin Asp Pro Gly Met Met Ser lie Pro 

1285 1290 1295 

Tyr Ala Asp Asn Leu Lys Ala Ser Lys Phe Ala Val Gin Ser Tyr Ala 

1300 1305 1310 

Ala Leu Val Leu Ala Arg Gin Gin Lys Ala Pro Leu Gly Ala Leu Arg 

1315 1320 1325 

Glu lie Trp Glu His Arg Ala Asp Ala Ala Ser Gly Leu Pro Leu Leu 

1330 1335 1340 

Gin Leu Gly Val Ala Leu Lys Thr Met Gly Asp Ala Thr Arg Gly Glu 
1345 1350 1355 1360 

Glu Ala lie Ala Leu Ala Leu Lys Thr Pro Arg Asn Ser Asp Glu Arg 

1365 1370 1375 

lie Trp Leu Gly Asp Tyr Gly Ser Ser Leu Arg Asp Asn Ala Leu Met 

1380 ~ 1385 1390 

Leu Ser Leu Leu Glu Glu Asn Lys Leu Leu Pro Asp Glu Gin Tyr Thr 

1395 1400 1405 

Leu Leu Asn Thr Leu Ser Gin Gin Ala Phe Gly Glu Arg Trp Leu Ser 

1410 1415 1420 

Thr Gin Glu Ser Asn Ala Leu Phe Leu Ala Ala Arg Thr lie Gin Asp 
1425 1430 1435 1440 

Leu Pro Gly Lys Trp Gin Ala Gin Thr Ser Phe Ser Ala Glu Gin Leu 

1445 1450 1455 

Thr Gly Glu Lys Ala Gin Asn Ser Asn Leu Asn Ser Asp Gin Leu Val 

1460 1465 1470 

Thr Leu Gin Val Ser Asn Ser Gly Asp Gin Pro Leu Trp Leu Arg Met 

1475 1480 1485 

Asp Ala Ser Gly Tyr Pro Gin Ser Ala Pro Leu Pro Ala Asn Asn Val 

1490 " " 1495 1500 

Leu Gin lie Glu Arg His lie Leu Gly Thr Asp Gly Lys Ser Lys Ser 
1505 1510 1515 1520 

Leu Asp Ser Leu Arg Ser Gly Asp Leu Val Leu Val Trp Leu Gin Val 

1525 1530 1535 

Lys Ala Ser Asn Ser Val Pro Asp Ala Leu Val Val Asp Leu Leu Pro 

1540 1545 1550 

Ala Gly Leu Glu Leu Glu Asn Gin Asn Leu Ala Asn Gly Ser Ala Ser 

1555 1560 1565 

Leu Glu Gin Ser Gly Gly Glu Val Gin Asn Leu Leu Asn Gin Met Gin 

1570 ' 1575 1580 

Gin Ala Ser lie Lys His He Glu Phe Arg Asp Asp Arg Phe Val Ala 
1585 1590 1595 1600 

Ala Val Ala Val Asp Glu Tyr Gin Pro Val Thr Leu Val Tyr Leu Ala 

1605 1610 1615 

Arg Ala Val Thr Pro Gly Thr Tyr Gin Val Pro Gin Pro Met Val Glu 

1620 1625 1630 

Ser Met Tyr Val Pro Gin Trp Arg Ala Thr Gly Ala Ala Glu Asp Leu 

1635 1640 1645 

Leu He Val Arg Pro 
1650 
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<210> 403 
<211> 264 
<212> PRT 

<213> Escherichia coii 
<400> 403 

Met Asp Leu Leu Tyr Arg Val Lys Thr Leu Trp Ala Ala Leu Arg Gly 

1 5 10 15 

Asn His Tyr Thr Trp Pro Ala He Asp lie Thr Leu Pro Gly Asn Arg 

20 25 30 

His Phe His Leu lie Gly Ser He His Met Gly Ser His Asp Met Ala 

35 40 45 

Pro Leu Pro Thr Arg Leu Leu Lys Lys Leu Lys Asn Ala Asp Ala Leu 

50 55 60 

He Val Glu Ala Asp Val Ser Thr Ser Asp Thr Pro Phe Ala Asn Leu 
65 " 70 75 80 

Pro Ala Cys Glu Ala Leu Glu Glu Arg He Ser Glu Glu Gin Leu Gin 

85 90 95 

Asn Leu Gin His He Ser Gin Glu Met Gly He Ser Pro Ser Leu Phe 

100 105 HO 

Ser Thr Gin Pro Leu Trp Gin He Ala Met Val Leu Gin Ala Thr Gin 

115 120 125 

Ala Gin Lys Leu Gly Leu Arg Ala Glu Tyr Gly He Asp Tyr Gin Leu 

130 135 140 

Leu Gin Ala Ala Lys Gin Gin His Lys Pro Val He Glu Leu Glu Gly 
145 150 155 160 

Ala Glu Asn Gin He Ala Met Leu Leu Gin Leu Pro Asp Lys Gly Leu 

165 170 175 

Ala Leu Leu Asp Asp Thr Leu Thr His Trp His Thr Asn Ala Arg Leu 

180 185 190 

Leu Gin Gin Met Met Ser Trp Trp Leu Asn Ala Pro Pro Gin Asn Asn 

195 200 205 

Asp He Thr Leu Pro Asn Thr Phe Ser Gin Ser Leu Tyr Asp Val Leu 

210 215 220 

Met His Gin Arg Asn Leu Ala Trp Arg Asp Lys Leu Arg Ala Met Pro 
225 230 235 240 

Pro Gly Arg Tyr Val Val Ala Val Gly Ala Leu His Leu Tyr Gly Glu 

245 250 255 

Gly Asn Leu Pro Gin Met Leu Arg 
260 



<210> 404 
<211> 322 
<212> PRT 

<213> Escherichia coli 
<400> 404 

Met Asp Asn Phe Leu Ala Leu Thr Leu Thr Gly Lys Lys Pro Val He 

1 5 10 15 

Thr Glu Arg Glu He Asn Gly Val Arg Trp Arg Trp Leu Gly Asp Gly 

20 25 30 

Val Leu Glu Leu Thr Pro Leu Thr Pro Pro Gin Gly Ala Leu Val He 

35 40 45 

Ser Ala Gly He His Gly Asn Glu Thr Ala Pro Val Glu Met Leu Asp 

50 " 55 60 

Ala Leu Leu Gly Ala He Ser His Gly Glu He Pro Leu Arg Trp Arg 
65 70 75 80 

Leu Leu Val He Leu Gly Asn Pro Pro Ala Leu Lys Gin Gly Lys Arg 

85 90 95 

Tyr Cys His Ser Asp Met Asn Arg Met Phe Gly Gly Arg Trp Gin Leu 
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<210> 405 
<211> 447 
<212> PRT 

<213> Escherichia coli 
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Leu Phe Val Tyr Gly Arg Glu Glu Gly Asn Asp Thr Arg Pro Ser Arg 

195 200 205 

Tyr Pro Ala Arg Gin Thr Arg Glu Ala Ser Glu Ala Val Ala Arg Leu 

210 215 220 

Asn Gin Val Asn Pro Gin Gin Val He Phe Ala Gin Gin Asn Pro Asp 
225 230 235 240 

Val He Asp Gin Gly Val Phe His Asn Asp Val He Ala Val Ser Asn 

245 250 255 

Arg Gin Val Leu Phe Cys His Gin Gin Ala Phe Ala Arg Gin Ser Gin 

260 265 270 

Leu Leu Ala Asn Leu Arg Ala Arg Val Asn Gly Phe Met Ala He Glu 

275 280 285 

Val Pro Ala Thr Gin Val Ser Val Ser Asp Thr Val Ser Thr Tyr Leu 

290 295 300 

Phe Asn Ser Gin Leu Leu Ser Arg Asp Asp Gly Ser Met Met Leu Val 
305 310 315 320 

Leu Pro Gin Glu Cys Arg Glu His Ala Gly Val Trp Gly Tyr Leu Asn 

325 330 335 

Glu Leu Leu Ala Ala Asp Asn Pro He Ser Glu Leu Lys Val Phe Asp 

340 345 350 

Leu Arg Glu Ser Met Ala Asn Gly Gly Gly Pro Ala Cys Leu Arg Leu 

355 360 . 365 

Arg Val Val Leu Thr Glu Glu Glu Arg Arg Ala Val Asn Pro Ala Val 

370 375 380 

Met Met Asn Asp Thr Leu Phe Asn Ala Leu Asn Asp Trp Val Asp Arg 
385 " 390 395 400 

Tyr Tyr Arg Asp Arg Leu Thr Ala Ala Asp Leu Ala Asp Pro Gin Leu 

405 410 415 

Leu Arg Glu Gly Arg Glu Ala Leu Asp Val Leu Ser Gin Leu Leu Asn 

420 " 425 430 

Leu Gly Ser Val Tyr Pro Phe Gin Arg Glu Gly Gly Gly Asn Gly 
435 440 445 



<210> 406 
<211> 492 
<212> PRT 

<213> Escherichia coli 
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Thr Gly Thr Glu lie Tyr Gly Gin lie Ser Ala Met Val Phe Glu Gin 

180 185 190 

His Lys Ser Ala Arg Thr Glu Leu Glu Lys Leu Gin Ala Gin Ala Ser 

195 200 205 

Gly Val Thr Leu Leu Thr Pro Glu Gin Val Gin Ser Leu Thr Ala Ser 

210 215 220 

Leu Gin Val Leu Thr Asp Glu Glu Lys Gin Leu lie Thr Ala Gin Gin 
225 230 235 240 

Gin Glu Gin Gin Ser Leu Asn Trp Leu Thr Arg Gin Asp Glu Leu Gin 

245 250 255 

Gin Glu Ala Ser Arg Arg Gin Gin Ala Leu Gin Gin Ala Leu Ala Glu 

260 ~ ~ 265 270 

Glu Glu Lys Ala Gin Pro Gin Leu Ala Ala Leu Ser Leu Ala Gin Pro 

275 280 285 

Ala Arg Asn Leu Arg Pro His Trp Glu Arg lie Ala Glu His Ser Ala 

290 295 300 

Ala Leu Ala His lie Arg Gin Gin lie Glu Glu Val Asn Thr Arg Leu 
305 310 315 320 

Gin Ser Thr Met Ala Leu Arg Ala Ser lie Arg His His Ala Ala Lys 

325 330 335 

Gin Ser Ala Glu Leu Gin Gin Gin Gin Gin Ser Leu Asn Thr Trp Leu 

340 345 350 

Gin Glu His Asp Arg Phe Arg Gin Trp Asn Asn Glu Pro Ala Gly Trp 

355 360 365 

Arg Ala Gin Phe Ser Gin Gin Thr Ser Asp Arg Glu His Leu Arg Gin 

370 375 380 

Trp Gin Gin Gin Leu Thr His Ala Glu Gin Lys Leu Asn Ala Leu Ala 
385 390 395 400 

Ala He Thr Leu Thr Leu Thr Ala Asp Glu Val Ala Thr Ala Leu Ala 

405 410 415 

Gin His Ala Glu Gin Arg Pro Leu Arg Gin His Leu Val Ala Leu His 

420 425 430 

Gly Gin He Val Pro Gin Gin Lys Arg Leu Ala Gin Leu Gin Val Ala 

435 440 445 

He Gin Asn Val Thr Gin Glu Gin Thr Gin Arg Asn Ala Ala Leu Asn 

450 455 460 

Glu Met Arg Gin Arg Tyr Lys Glu Lys Thr Gin Gin Leu Ala Asp Val 
465 " 470 475 480 

Lys Thr He Cys Glu Gin Glu Ala Arg He Lys Thr Leu Glu Ala Gin 

485 490 495 

Arg Ma Gin Leu Gin Ala Gly Gin Pro Cys Pro Leu Cys Gly Ser Thr 

500 505 ' 510 

Ser His Pro Ala Val Glu Ala Tyr Gin Ala Leu Glu Pro Gly Val Asn 

515 520 525 

Gin Ser Arg Leu Leu Ala Leu Glu Asn Glu Val Lys Lys Leu Gly Glu 

530 " 535 540 

Glu Gly Ala Thr Leu Arg Gly Gin Leu Asp Ala He Thr Lys Gin Leu 
545 550 . 555 560 

Gin Arg Asp Glu Asn Glu Ala Gin Ser Leu Arg Gin Asp Glu Gin Ala 

565 570 575 

Leu Thr Gin Gin Trp Gin Ala Val Thr Ala Ser Leu Asn He Thr Leu 

580 585 590 

Gin Pro Leu Asp Asp He Gin Pro Trp Leu Asp Ala Gin Asp Glu His 

595 600 605 

Glu Arg Gin Leu Arg Leu Leu Ser Gin Arg His Glu Leu Gin Gly Gin 

610 615 620 

He Ala Ala His Asn Gin Gin He He Gin Tyr Gin Gin Gin He Glu 
625 630 635 640 

Gin Arg Gin Gin Leu Leu Leu Thr Thr Leu. Thr Gly Tyr Ala Leu Thr 

645 650 655 

Leu Pro Gin Glu Asp Glu Glu Glu Ser Trp Leu Ala Thr Arg Gin Gin 
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<210> 410 
<211> 400 
<212> PRT 

<213> Escherichia coli 
<400> 410 

Met Arg He Leu His Thr Ser Asp 

1 5 
Ser Lys Ser Arg Glu Ala Glu His 
20 



Trp His Leu Gly Gin Asn Phe Tyr 

10 15 
Gin Ala Phe Leu Asp Trp Leu Leu 
25 30 
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Gin Thr His Gin Val Asp Ala He He Val Ala Gly Asp 

40 45 
Thr Gly Ser Pro Pro Ser Tyr Ala Arg Thr Leu Tyr Asn 

55 60 
Val Asn Leu Gin Gin Thr Gly Cys His Leu Val Val Leu 

70 75 80 

His Asp Ser Val Ala Thr Leu Asn Glu Ser Arg Asp He 

85 90 95 

Leu Asn Thr Thr Val Val Ala Ser Ala Gly His Ala Pro 
100 105 110 

Pro Arg Arg Asp Gly Thr Pro Gly Ala Val Leu Cys Pro 

120 125 
Leu Arg Pro Arg Asp He He Thr Ser Gin Ala Gly Leu 

135 140 
Glu Lys Gin Gin His Leu Leu Ala Ala He Thr Asp Tyr 
150 155 160 

His Tyr Ala Asp Ala Cys Lys Leu Arg Gly Asp Gin Pro 

165 170 175 

He Ala Thr Gly His Leu Thr Thr Val Gly Ala Ser Lys 
180 * 185 . 190- 

Val Arg Asp He Tyr He Gly Thr Leu Asp Ala Phe Pro 

200 205 
Phe Pro Pro Ala Asp Tyr He Ala Leu Gly His He His 

215 220 
He He Gly Gly Met Glu His Val Arg Tyr Cys Gly Ser 
230 235 240 

Leu Ser Phe Asp Glu Cys Gly Lys Ser Lys Tyr Val His 

245 250 255 

Phe Ser Asn Gly Lys Leu Glu Ser Val Glu Asn Leu Asn 
260 265 270 

Thr Gin Pro Met Ala Val Leu Lys Gly Asp Leu Ala Ser 

280 285 
Gin Leu Glu Gin Trp Arg Asp Val Ser Gin Glu Pro Pro 

295 " 300 
Asp He Glu He Thr Thr Asp Glu Tyr Leu His Asp He 
310 315 320 

He Gin Ala Leu Thr Glu Ser Leu Pro Val Glu Val Leu 

325 330 335 

Arg Ser Arg Glu Gin Arg Glu Arg Val Leu Ala Ser Gin 
340 345 350 

Thr Leu Ser Glu Leu Ser Val Glu Glu Val Phe Asn Arg 

360 365 
Leu Glu Glu Leu Asp Glu Ser Gin Gin Gin Arg Leu Gin 

375 380 
Thr Thr Thr Leu His Thr Leu Ala Gly Glu His Glu Ala 
390 395 400 



<210> 411 
<211> 254 
<212> PRT 

<213> Escherichia coli 
<400> 411 

Met Asn Asp Leu Phe Ser Leu Ala 

1 5 
Ser Ala Gin Gly He Gly Phe Leu 
20 

Gly Ala Gin He He He Asn Asp 

35 40 
Ala Val Glu Lys Leu His Gin Glu 



Gly Lys Asn He Leu He Thr Gly 

10 15 
Leu Ala Thr Gly Leu Gly Lys Tyr 
25 30 
He Thr Ala Glu Arg Ala Glu Leu 
45 

Gly He Gin Ala Val Ala Ala Pro 
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<210> 412 
<211> 343 
<212> PRT 

<213> Escherichia coli 
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Ala Thr Asp Glu Leu He Ala Ala Val Thr Ala His Trp Arg Glu Asp 

180 185 190 

Phe Thr Val Leu Arg Leu His Gly Asp Cys His Ala Gly Asn He Leu 

195 200 205 

Trp Arg Asp Gly Pro Met Phe Val Asp Leu Asp Asp Ala Arg Asn Gly 

210 215 220 

Pro Ala Val Gin Asp Leu Trp Met Leu Leu Asn Gly Asp Lys Ala Glu 
225 230 235 240 

Gin Arg Met Gin Leu Glu Thr He He Glu Ala Tyr Glu Glu Phe Ser 

245 250 255 

Glu Phe Asp Thr Ala Glu He Gly Leu He Glu Pro Leu Arg Ala Met 

260 265 270 

Arg Leu Val Tyr Tyr Leu Ala Trp Leu Met Arg Arg Trp Ala Asp Pro 

275 280 285 

Ala Phe Pro Lys Asn Phe Pro Trp Leu Thr Gly Glu Asp Tyr Trp Leu 

290 295 300 

Arg Gin Thr Ala Thr Phe He Glu Gin Ala Lys Val Leu Gin Glu Pro 
305 310 315 320 

Pro Leu Gin Leu Thr Pro Met Tyr 
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<210> 419 

'<211> 208 

<212> PRT 

<213> Escherichia coli 
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<213> Escherichia coli 
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Leu Leu Thr Glu Arg Lys Leu Ser Leu Glu Pro Leu lie Ala His Arg 
305 310 315 320 

Gly Ser Phe Glu Ser Phe Ala Gin Ala Val Arg Asp lie Ala Arg Asn 

325 330 335 

Ala Met Pro Gly Lys Val Leu Leu lie Pro 
340 345 



<210> 422 
<211> 451 
<212> PRT 

<213> Escherichia coli 
<400> 422 

Met Phe Ser Glu Val Met Arg Tyr lie Leu Asp Leu Gly Pro Thr Val 

15 10 15 

Met Leu Pro lie Val He He He Phe Ser Lys He Leu Gly Met Lys 

20 25 30 

Ala Gly Asp Cys Phe Lys Ala Gly Leu His He Gly He Gly Phe Val 

35 40 45 

Gly He Gly Leu Val He Gly Leu Met Leu Asp Ser He Gly Pro Ala 

50 ^ 55 60 

Ala Lys Ala Met Ala Glu Asn Phe Asp Leu Asn Leu His Val Val Asp 
65 70 75 80 

Val Gly Trp Pro Gly Ser Ser Pro Met Thr Trp Ala Ser Gin He Ala 

85 90 95 

Leu Val Ala He Pro He Ala He Leu Val Asn Val Ala Met Leu Leu 

100 105 110 

Thr Arg Met Thr Arg Val Val Asn Val Asp He Trp Asn He Trp His 

115 " 120 125 

Met Thr Phe Thr Gly Ala Leu Leu His Leu Ala Thr Gly Ser Trp Met 

130 135 140 

He Gly Met Ala Gly Val Val He His Ala Ala Phe Val Tyr Lys Leu 
145 ~ 150 155 160 

Gly Asp Trp Phe Ala Arg Asp Thr Arg Asn Phe Phe Glu Leu Glu Gly 

165 170 175 

He Ala He Pro His Gly Thr Ser Ala Tyr Met Gly Pro He Ala Val 

180 185 190 

Leu Val Asp Ala He He Glu Lys He Pro Gly Val Asn Arg He Lys 

195 200 205 - 

Phe Ser Ala Asp Asp He Gin Arg Lys Phe Gly Pro Phe Gly Glu Pro 
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Val Thr Val Gly Phe Val Met Gly Leu He He Gly He Leu Ala Gly 
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Tyr Asp Val Lys Gly Val Leu Gin Leu Ala Val Lys Thr Ala Ala Val 
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Met Leu Leu Met Pro Arg Val He Lys Pro He Met Asp Gly Leu Thr 
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Gin Glu Phe Leu He Gly Leu Asp Pro Ala Leu Leu Leu Gly His Thr 
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Ala Val Val Ser Ala Ser Leu He Phe He Pro Leu Thr He Leu He 
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Ala Val Cys Val Pro Gly Asn Gin Val Leu Pro Phe Gly Asp Leu Ala 
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Thr lie Gly Phe Phe Val Ala Met Ala Val Ala Val His Arg Gly Asn 

340 345 350 

Leu Phe Arg Thr Leu He Ser Gly Val He He Met Ser He Thr Leu 
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Trp lie Ala Thr Gin Thr He Gly Leu His Thr Gin Leu Ala Ala Asn 
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<210> 425 
<211> 420 
<212> PRT 

<213> Escherichia coli 
<400> 425 

Met Lys Thr Leu He Ala Arg His Lys Ala Gly Glu His He Gly He 

1 5 10 15 

Cys Ser Val Cys Ser Ala His Pro Leu Val He Glu Ala Ala Leu Ala 

20 25 30 

Phe Asp Arg Asn Ser Thr Arg Lys Val Leu He Glu Ala Thr Ser Asn 

35 40 45 

Gin Val Asn Gin Phe Gly Gly Tyr Thr Gly Met Thr Pro Ala Asp Phe 

50 55 60 

Arg Glu Phe Val Phe Thr He Ala Asp Lys Val Gly Phe Ala Arg Glu 
65 70 75 80 

Arg He He Leu Gly Gly Asp His Leu Gly Pro Asn Cys Trp Gin Gin 

85 90 95 

Glu Asn Ala Asp Ala Ala Met Glu Lys Ser Val Glu Leu Val Lys Glu 

100 105 110 

Tyr Val Arg Ala Gly Phe Ser Lys He His Leu Asp Ala Ser Met Ser 

115 120 125 

Cys Ala Gly Asp Pro He Pro Leu Ala Pro Glu Thr Val Ala Glu Arg 

130 135 140 

Ala Ala Val Leu Cys Phe Ala Ala Glu Ser Val Ala Thr Asp Cys Gin 
145 150 155 160 

Arg Glu Gin Leu Ser Tyr Val He Gly Thr Glu Val Pro Val Pro Gly 

165 170 175 

Gly Glu Ala Ser Ala He Gin Ser Val His He Thr His Val Glu Asp 

180 185 190 

Ala Ala Asn Thr Leu Arg Thr His Gin Lys Ala Phe He Ala Arg Gly 

195 200 205 

Leu Thr Glu Ala Leu Thr Arg Val He Ala He Val Val Gin Pro Gly 

210 215 220 

Val Glu Phe Asp His Ser Asn He He His Tyr Gin Pro Gin Glu Ala 
225 230 235 240 

Gin Pro Leu Ala,. Gin Trp He Glu Asn Thr Arg Met Val Tyr Glu Ala 

245 250 255 

His Ser Thr Asp Tyr Gin Thr Arg Thr Ala Tyr Trp Glu Leu Val Arg 

260 * 265 270 

Asp His Phe Ala He Leu Lys Val Gly Pro Ala Leu Thr Phe Ala Leu 

275 280 285 

Arg Glu Ala He Phe Ala Leu Ala Gin He Glu Gin Glu Leu He Ala 

290 295 300 

Pro Glu Asn Arg Ser Gly Cys Leu Ala Val He Glu Glu Val Met Leu 
305 310 315 320 

Asp Glu Pro Gin Tyr Trp Lys Lys Tyr Tyr Arg Thr Gly Phe Asn Asp 

325 330 335 

Ser Leu Leu Asp He Arg Tyr Ser Leu Ser Asp Arg He Arg Tyr Tyr 

340 345 350 

Trp Pro His i Ser Arg He Lys Asn Ser Val Glu Thr Met Met Val Asn 

355 360 365 

Leu Glu Gly Val Asp He Pro Leu Gly Met He Ser Gin Tyr Leu Pro 

370 . 375 380 

Lys Gin Phe Glu Arg He Gin Ser Gly Glu Leu Ser Ala He Pro His 
385 390 395 400 

Gin Leu He Met Asp Lys He Tyr Asp Val Leu Arg Ala Tyr Arg Tyr 
405 410 415 

Gly Cys Ala Glu 
420 
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<210> 426 
<211> 286 
<212> PRT 

<213> Escherichia coli 
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<210> 427 
<211> 157 
<212> PRT 

<213> Escherichia coli 



<400> 427 
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Phe Ala Thr Ser Pro lie His Thr 
85 

Tyr Lys Lys Glu He He Ser Leu 
100 

Ala Asn His Leu Arg Ser He Leu 
115 120 
Thr Ala Glu Asp Arg Asn His Cys 

130 135 
Arg Glu Lys Asn Tyr Thr Pro Asn 
145 150 



Glu Asn He Leu He Thr Leu His 

90 95 
Leu Leu Glu Thr Thr Pro He Lys 
105 HO 
Asp Tyr He Glu Gin Glu Gin Leu 
125 

Met Lys Leu Ser Lys Lys He His 
140 

Ser Lys Ser Gin Trp 
155 



<210> 428 
<211> 471 
<212> PRT 

<213> Escherichia coli 
<400> 428 

Met Gin Ser Pro Ser Asp Ala He Phe Cys Arg His Leu Ser Leu Gin 

15 10 15 

Tyr Ala Leu Asp Ser Leu Arg Asn Gly Lys Gly Lys Val Asn Leu He 

20 25 30 

Lys His Tyr Ser Ser Val Glu Ser He Gin Gin His Val Pro Leu Val 

35 40 45 

Arg Asp Ala Glu Phe Arg Ala Leu Leu Arg His Pro Pro Ala Gly Ser 

50 55 60 

Arg Val He Ala Ser Lys Asp Phe Gly Phe Ala Leu Asp He Phe Phe 
65 70 75 80 

Cys Arg Met Met Ala Asn Asn Val Ser His Met Ser Ala He Leu Tyr 

85 90 95 

He Asp Asn His Thr Leu Ser Val Arg Leu Arg He Lys Gin Ser Val 

100 105 HO 

Tyr Gly Gin Leu Asn Tyr Val Val Ser Val Tyr Asp Pro Asn Asp Thr 

115 120 125 

Asn Val Ala Val Arg Asp Thr His Arg Thr Ala Arg Gly Phe Leu Ser 

130 135 140 

Leu Asp Lys Phe He Ser Ser Gly Pro Asp Ala Gin Thr Trp Ala Asp 
145 150 155 160 

Arg Tyr Val Arg Asn Cys Ala He Ala He Leu Pro Leu Leu Pro Val 

165 170 175 

Gly Val Pro Gly Ala He Phe Ala Gly He Ala Ser Arg Met Pro Phe 

180 185 190 

Ala Pro He His Pro Ser Ala Met Leu Leu He Met Ala Thr Gly Gin 

195 200 205 

Ser Gin Gin Leu He Thr Leu Phe Lys Gin Leu Pro He Leu Pro Glu 

210 215 220 

Lys Glu He He Glu He He Thr Ala Gin Asn Ser Val Gly Thr Pro 
225 230 235 240 

Ala Leu Phe Leu Ala Met Met Asn Gly His Thr Asp Asn Val Lys He 

245 250 255 

Phe Met Gin Glu He Gin Ser Leu Val Asp Asn His He He His Glu 

260 265 270 

Asp Asn Leu Val Lys Leu Leu Gin Thr Lys Ser Ala Asn Glu Thr Pro 

275 " 280 285 

Gly Leu Tyr He Ser Met Leu Tyr Gly Phe Asp Glu He He Asp He 

290 295 300 

Phe Leu Asn Ala Leu Thr Thr Pro He Ala Gin Glu Leu Leu Asn Lys 
305 310 315 320 

Lys Leu Val Met Ser He Leu Ala Met Lys He His Asp Gly Glu Pro 

325 330 335 

Gly Leu Tyr Ala Ala Met Glu Asn Asn His Pro Leu Cys Val Thr Arg 
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<210> 429 
<211> 128 
<212> PRT 

<213> Escherichia coli 
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<210> 430 
<211> 398 
<212> PRT 

<213> Escherichia coli 



<400> 430 
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Lys Gly Gly Ala Leu Met Asn Gly Leu Asn Tyr Ala Thr Cys Asp Gin 

100 105 110 

Val Phe Leu Ser Asp Ala Asp Thr Tyr Val Pro Pro Asp Gin Asp Gly 

115 120 125 

Met Gly Tyr Met Leu Ala Glu He Glu Arg Gly Ala Asp Ala Val Gly 

130 135 140 

Gly He Pro Ser Thr Ala Leu Lys Gly Ala Gly Leu Leu Pro His He 
145 150 155 160 

Arg Ala Thr Val Lys Leu Pro Met He Val Met Lys Arg Thr Leu Gin 

165 170 175 

Gin Leu Leu Gly Gly Ala Pro Phe He He Ser Gly Ala Cys Gly Met 

180 185 190 

Phe Arg Thr Asp Val Leu Arg Lys Phe Gly Phe Ser Asp Arg Thr Lys 

195 200 205 

Val Glu Asp Leu Asp Leu Thr Trp Thr Leu Val Ala Asn Gly Tyr Arg 

210 215 220 

He Arg Gin Ala Asn Arg Cys He Val Tyr Pro Gin Glu Cys Asn Ser 
225 230 235 240 

Pro Arg Glu Glu Trp Arg Arg Trp Arg Arg Trp He Val Gly Tyr Ala 

245 250 255 

Val Cys Met Arg Leu His Lys Arg Leu Leu Phe Ser Arg Phe Gly He 

260 265 270 

Phe Ser He Phe Pro Met Leu Leu Val Val Leu Tyr Gly Val Gly He 

275 280 285 

Tyr Leu Thr Thr Trp Phe Asn Glu Phe He Thr Thr Gly Pro His Gly 

290 * 295 300 

Val Val Leu Ala Met Phe Pro Leu He Trp Val Gly Val Val Cys Val 
305 310 315 320 

He Gly Ala Phe Ser Ala Trp Phe His Arg Cys Trp Leu Leu Val Pro 

325 330 335 

Leu Ala Pro Leu Ser Val Val Tyr Val Leu Leu Ala Tyr Ala He Trp 

340 345 350 

He He Tyr Gly Leu He Ala Phe Phe Thr Gly Arg Glu Pro Gin Arg 

355 ~ 360 365 

Asp Lys Pro Thr Arg Tyr Ser Ala Leu Val Glu Ala Ser Thr Ala Tyr 

370 375 380 

Ser Gin Pro Ser Val Thr Gly Thr Glu Lys Leu Ser Glu Ala 
385 390 395 



<210> 431 
<211> 552 
<212> PRT 

<213> Escherichia coli 



<400> 431 

Met He Leu Glu Arg Val Glu He Val Gly Phe Arg Gly He Asn Arg 

15 10 15 

Leu Ser Leu Met Leu Glu Gin Asn Asn Val Leu He Gly Glu Asn Ala 

20 25 30 

Trp Gly Lys Ser Ser Leu Leu Asp Ala Leu Thr Leu Leu Leu Ser Pro 

35 40 . 45 

Glu Ser Asp Leu Tyr His Phe Glu Arg Asp Asp Phe Trp Phe Pro Pro 

50 55 60 

Gly Asp He Asn Gly Arg Glu His His Leu His He He Leu Thr Phe 
65 70 75 80 

Arg Glu Ser Leu Pro Gly Arg His Arg Val Arg Arg Tyr Arg Pro Leu 

85 90 95 

Glu Ala Cys Trp Thr Pro Cys Thr Asp Gly Tyr His Arg He Phe Tyr 

100 105 HO 

Arg Leu Glu Gly Glu Ser Ala Glu Asp Gly Ser Val Met Thr Leu Arg 
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<210> 432 
<211> 352 
<212> PRT 

<213> Escherichia coli 
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<400> 432 

Met Leu Pro Ser He Ser He Asn Asn Thr Ser Ala Ala Tyr Pro Glu 

15 10 15 

Ser He Asn Glu Asn Asn Asn Asp Glu Val Asn Gly Leu Val Gin Glu 

20 25 30 

Phe Lys Asn Leu Phe Asn Gly Lys Glu Gly He Ser Thr Cys He Lys 

35 40 45 

His Leu Leu Glu Leu He Lys Asn Ala He Arg Val Asn Asp Asp Pro 

50 55 60 

Tyr Arg Phe Asn He Asn Asn Ser Ser Val Thr Tyr He Asp He Asp 
65 70 75 80 

Ser Asn Asp Thr Asp His He Thr He Gly He Asp Asn Gin Glu Pro 

85 90 95 

He Glu Leu Pro Ala Asn Tyr Lys Asp Lys Glu Leu Val Arg Thr He 

100 105 110 

He Asn Asp Asn He Val Glu Lys Thr His Asp He Asn Asn Lys Glu 

115 120 125 

Met He Phe Ser Ala Leu Lys Glu He Tyr Asp Gly Asp Pro Gly Phe 

130 135 .140 

He Phe Asp Lys He Ser His Lys Leu Arg His Thr Val Thr Glu Phe 
145 150 155 160 

Asp Glu Ser Gly Lys Ser Glu Pro Thr Asp Leu Phe Thr Trp Tyr Gly 

165 170 175 

Lys Asp Lys Lys Gly Asp Ser Leu Ala He Val He Lys Asn Lys Asn 

180 185 190 

Gly Asn Asp Tyr Leu Ser Leu Gly Tyr Tyr Asp Gin Asp Asp Tyr His 

195 200 205 

He Gin Arg Gly He Arg He Asn Gly Asp Ser Leu Thr Gin Tyr Cys 

210 215 220 

Ser Glu Asn Ala Arg Ser Ala Ser Ala Trp Phe Glu Ser Ser Lys Ala 
225 230 235 240 

He Met Ala Glu Ser Phe Ala Thr Gly Ser Asp His Gin Val Val Asn 

245 250 255 

Glu Leu Asn Gly Glu Arg Leu Arg Glu Pro Asn Asp Val Phe Lys Arg 

260 265 270 

Tyr Gly Arg Ala He Arg Tyr Asp Phe Gin Val Asp Asp Ala Lys Tyr 

275 " 280 285 

Lys Cys Asp His Leu Lys Glu He Val Ser Thr Leu Val Gly Asn Lys 

290 295 300 

He Asn Val Gly His Ser Gin Lys He Tyr Lys His Phe Lys Asp Leu 
305 310 315 320 

Glu Gly Lys He Glu Glu Arg Leu Gin Asn Arg Gin Ala Glu Tyr Gin 

325 330 335 

Asn Glu He Asn Gin Pro Ser Ala Pro Gly Val Asn Phe Asp Asp He 
340 345 350 



<210> 433 
<211> 375 
<212> PRT 

<213> Escherichia coli 
<400> 433 

Met Met Arg His Leu Arg Asn He Phe Asn Leu Gly He Lys Glu Leu 

15 10 15 

Arg Ser Leu Leu Gly Asp Lys Ala Met Leu Thr Leu He Val Phe Ser 

20 25 30 

Phe Thr Val Ser Val Tyr Ser Ser Ala Thr Val Thr Pro Gly Ser Leu 

35 40 45 

Asn Leu Ala Pro He Ala He Ala Asp Met Asp Gin Ser Gin Leu Ser 
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<210> 434 
<211> 894 
<212> PRT 

<213> Escherichia coli 
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Ala Arg Leu Phe Gly His Asp Lys Ala Glu Arg Glu Val Arg lie Asn 

100 105 110 

Glu Leu Leu Thr Ser Thr Gly Leu Ala Pro Phe Arg Asp Arg Pro Ala 

115 120 125 

Gly Lys Leu Ser Gly Gly Met Lys Gin Lys Leu Gly Leu Cys Cys Ala 

130 135 140 

Leu He His Asp Pro Glu Leu Leu He Leu Asp Glu Pro Thr Thr Gly 
145 150 155 160 

Val Asp Pro Leu Ser Arg Ser Gin Phe Trp Asp Leu He Asp Ser He 

165 170 175 

Arg Gin Arg Gin Ser Asn Met Ser Val Leu Val Ala Thr Ala Tyr Met 

180 185 190 

Glu Glu Ala Glu Arg Phe Asp Trp Leu Val Ala Met Asn Ala Gly Glu 

195 200 205 

Val Leu Ala Thr Gly Ser Ala Glu Glu Leu Arg Gin Gin Thr Gin Ser 

210 215 220 

Ala Thr Leu Glu Glu Ala Phe He Asn Leu Leu Pro Gin Ala Gin Arg 
225 230 235 240 

Gin Ala His Gin Ala Val Val He Pro Pro Tyr Gin Pro Glu Asn Ala 

245 250 . 255 

Glu He Ala He Glu Ala Arg Asp Leu Thr Met Arg Phe Gly Ser Phe 

260 265 270 

Val Ala Val Asp His Val Asn Phe Arg He Pro Arg Gly Glu He Phe 

275 280 285 

Gly Phe Leu Gly Ser Asn Gly Cys Gly Lys Ser Thr Thr Met Lys Met 

290 295 300 

Leu Thr Gly Leu Leu Pro Ala Ser Glu Gly Glu Ala Trp Leu Phe Gly 
305 ^ 310 315 320 

Gin Pro Val Asp Pro Lys Asp He Asp Thr Arg Arg Arg Val Gly Tyr 

325 330 335 

Met Ser Gin Ala Phe Ser Leu Tyr Asn Glu Leu Thr Val Arg Gin Asn 

340 345 350 

Leu Glu Leu His Ala Arg Leu Phe His He Pro Glu Ala Glu He Pro 

355 ~ 360 365 

Ala Arg Val Ala Glu Met Ser Glu Arg Phe Lys Leu Asn Asp Val Glu 

370 375 380 

Asp He Leu Pro Glu Ser Leu Pro Leu Gly He Arg Gin Arg Leu Ser 
385 390 395 400 

Leu Ala Val Ala Val He His Arg Pro Glu Met Leu He Leu Asp Glu 

405 410 415 

Pro Thr Ser Gly Val Asp Pro Val Aia Arg Asp Met Phe Trp Gin Leu 

420 425 430 

Met Val Asp Leu Ser Arg Gin Asp Lys Val Thr He Phe He Ser Thr 

435 440 445 

His Phe Met Asn Glu Ala Glu Arg Cys Asp Arg He Ser Leu Met His 

450 455 460 

Ala Gly Lys Val Leu Ala Ser Gly Thr Pro Gin Glu Leu Val Glu Lys 
465 470 475 480 

Arg Gly Ala Ala Ser Leu Glu Glu Ala Phe He Ala Tyr Leu Gin Glu 

485 - 490 495 

Ala Ala Gly Gin Ser Asn Glu Ala Glu Ala Pro Pro Val Val His Asp 

500 505 510 

Thr Thr His Ala Pro Arg Gin Gly Phe Ser Leu Arg Arg Leu Phe Ser 

515 - 520 . 525 

Tyr Ser Arg Arg Glu Ala Leu Glu Leu Arg Arg Asp Pro Val Arg Ser 

530 • 535 540 

Thr Leu Ala Leu Met Gly Thr Val He Leu Met Leu He Met Gly Tyr 
545 550 555 560 

Gly He Ser Met Asp Val Glu Asn Leu Arg Phe Ala Val Leu Asp Arg 

565 * 570 575 

Asp Gin Thr Val Ser Ser Gin Ala Trp Thr Leu Asn Leu Ser Gly Ser 
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<210> 435 
<211> 355 
<212> PRT 

<213> Escherichia coli 
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<210> 437 
<211> 480 
<212> PRT 

<213> Escherichia coli 
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Val Glu Glu Ala He Ala Met Ser 
365 

Lys Gly Val Thr Ala He He Thr 
380 

Leu Met Thr Ser Arg He Ser Ser 
395 400 
Arg His Glu Arg Thr Leu Asn Leu 

410 415 
Pro Val His Phe Asp Ser Ala Asn 
425 430 
Ala Val Asn Leu Leu Arg Asp Lys 
445 

Val lie Val Thr Gin Gly Asp Val 
460 

Thr Thr Arg lie Leu Thr Val Glu 
475 480 



<210> 438 
<211> 239 
<212> PRT 

<213> Escherichia coli 



<400> 438 

Met He Asn Val Leu He He Asp Asp Asp Ala Met Val Ala Glu Leu 

1 5 10 15 

Asn Arg Arg Tyr Val Ala Gin He Pro Gly Phe Gin Cys Cys Gly Thr 

20 25 30 

Ala Ser Thr Leu Glu Lys Ala Lys Glu He He Phe Asn Ser Asp Thr 

35 40 45 

Pro He Asp Leu He Leu Leu Asp He Tyr Met Gin Lys Glu Asn Gly 

50 " 55 60 

Leu Asp Leu Leu Pro Val Leu His Asn Ala Arg Cys Lys Ser Asp Val 
65 " 70 75 80 

He Val He Ser Ser Ala Ala Asp Ala Ala Thr He Lys Asp Ser Leu 

85 90 95 

His Tyr Gly Val Val Asp Tyr Leu He Lys Pro Phe Gin Ala Ser Arg 

100 105 HO 

Phe Glu Glu Ala Leu Thr Gly Trp Arg Gin Lys Lys Met Ala Leu Glu 

115 120 125. 

Lys His Gin Tyr Tyr Asp Gin Ala Glu Leu Asp Gin Leu He His Gly 

130 135 140 

Ser Ser Ser Asn Glu Gin Asp Pro Arg Arg Leu Pro Lys Gly Leu Thr 
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Pro Gin Thr Leu Arg Thr Leu Cys Gin Trp He Asp Ala His Gin Asp 

165 170 175 

Tyr Glu Phe Ser Thr Asp Glu Leu Ala Asn Glu Val Asn He Ser Arg 

180 185 190 

Val Ser Cys Arg Lys Tyr Leu He Trp Leu Val Asn Cys His He Leu 

195 200 205 

Phe Thr Ser He His Tyr Gly Val Thr Gly Arg Pro Val Tyr Arg Tyr 

210 215 .220 

Arg He Gin Ala Glu His Tyr Ser Leu Leu Lys Gin Tyr Cys Gin 
225 230 235 



<210> 439 
<211> 543 
<212> PRT 

<213> Escherichia coli 
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Ala Pro Asp Lys He Asp His He Phe Asp Lys Gly Val Ser Thr Lys 

485 490 495 

Gly Ser Glu Arg Gly Val Gly Leu Ala Leu Val Lys Gin Gin Val Glu 

500 505 510 

Asn Leu Gly Gly Ser He Ala Val Glu Ser Glu Pro Gly He Phe Thr 

515 520 525 

Gin Phe Phe Val Gin He Pro Trp Asp Gly Glu Arg Ser Asn Arg 
530 535 540 



<210> 440 
<211> 328 
<212> PRT 

<213> Escherichia coli 
<400> 440 

Met Ser Val Pro Leu Ser Thr Trp Asn Leu Leu Arg Tyr Asn Asn Ser 

15 10 15 

Tyr Leu Gin Lys Val Thr Met Phe Pro Gin Cys Lys Phe Ser Arg Glu 

20 25 30 

Phe Leu His Pro Arg Tyr Trp Leu Thr Trp Phe Gly Leu Gly Val Leu 

35 40 45 

Trp Leu Trp Val Gin Leu Pro Tyr Pro Val Leu Cys Phe Leu Gly Thr 

50 ^ 55 60 

Arg He Gly Ala Met Ala Arg Pro Phe Leu* Lys Arg Arg Glu Ser He 
65 70 75 80 

Ala Arg Lys Asn Leu Glu Leu Cys Phe Pro Gin His Ser Ala Glu Glu 

85 90 95 

Arg Glu Lys Met He Ala Glu Asn Phe Arg Ser Leu Gly Met Ala Leu 

100 '105 110 

Val Glu Thr Gly Met Ala Trp Phe Trp Pro Asp Ser Arg Val Arg Lys 

115 120 125 

Trp Phe Asp Val Glu Gly Leu Asp Asn Leu Lys Arg Ala Gin Met Gin 

130 135 140 

Asn Arg Gly Val Met Val Val Gly Val His Phe Met Ser Leu Glu Leu 
145 " 150 155 160 

Gly Gly Arg Val Met Gly Leu Cys Gin Pro Met Met Ala Thr Tyr Arg 

165 170 175 

Pro His Asn Asn Gin Leu Met Glu Trp Val Gin Thr Arg Gly Arg Met 

180 185 190 

Arg Ser Asn Lys Ala Met He Gly Arg Asn Asn Leu Arg Gly He Val 

195 200. 205 

Gly Ala Leu Lys Lys Gly Glu Ala Val Trp Phe Ala Pro Asp Gin Asp 

210 " 215 220 

Tyr Gly Arg Lys Gly Ser Ser Phe Ala Pro Phe Phe Ala Val Glu Asn 
225 230 235 240 

Val Ala Thr Thr Asn Gly Thr Tyr Val Leu Ser Arg Leu Ser Gly Ala 

245 250 255 

Ala Met Leu Thr Val Thr Met Val Arg Lys Ala Asp Tyr Ser Gly Tyr 

260 265 270- 

Arg Leu Phe He Thr Pro Glu Met Glu Gly Tyr Pro Thr Asp Glu Asn 

275 280 285 

Gin Ala Ala Ala Tyr Met Asn Lys He He Glu Lys Glu He Met Arg 

290 295 300 

Ala Pro Glu Gin Tyr Leu Trp He His Arg Arg Phe Lys Thr Arg Pro 
305 310 315 320 

Val Gly Glu Ser Ser Leu Tyr He 
325 



<210> 441 
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<211> 87 
<212> PRT 

<213> Escherichia coli 
<400> 441 
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<210> 442 
<211> 430 
<212> PRT 

<213> Escherichia coli 
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315 320 
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Asn Cys Ser Val Leu Thr Lys Ala 
345 350 
Arg Val Asn Phe Asp Glu Ala Ser 
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Leu Arg Gly Glu Glu Asn He Ser 
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Leu Gin Lys Phe Phe Glu Glu Asp 
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He Met Gin Asp Ala Asp 
425 430 



<210> 443 
<211> 883 
<212> PRT 

<213> Escherichia coli 
<400> 443 

Met Asn Glu Gin Tyr Ser Ala Leu Arg Ser Asn Val Ser Met Leu Gly 

15 10 15 

Lys Val Leu Gly Glu Thr He Lys Asp Ala Leu Gly Glu His He Leu 

20 25 30 

Glu Arg Val Glu Thr He Arg Lys Leu Ser Lys Ser Ser Arg Ala Gly 

35 40 45 

Asn Asp Ala Asn Arg Gin Glu Leu Leu Thr Thr Leu Gin Asn Leu Ser 

50 55 60 

Asn Asp Glu Leu Leu Pro Val Ala Arg Ala Phe Ser Gin Phe Leu Asn 
65 70 75 80 

Leu Ala Asn Thr Ala Glu Gin Tyr His Ser He Ser Pro Lys Gly Glu 

85 90 95 

Ala Ala Ser Asn Pro Glu Val He Ala Arg Thr Leu Arg Lys Leu Lys 

100 105 110 

Asn Gin Pro Glu Leu Ser Glu Asp Thr He Lys Lys Ala Val Glu Ser 

115 120 125 . 

Leu Ser Leu Glu Leu Val Leu Thr Ala His Pro Thr Glu He Thr Arg 

130 135 140 

Arg Thr Leu He His Lys Met Val Glu Val Asn Ala Cys Leu Lys Gin 
145 150 155 • 160 

Leu Asp Asn Lys Asp He Ala Asp Tyr Glu His Asn Gin Leu Met Arg 

165 170 175 

Arg Leu Arg Gin Leu He Ala Gin Ser Trp His Thr Asp Glu He Arg 

180 185 190 

Lys Leu Arg Pro Ser Pro Val Asp Glu Ala Lys Trp Gly Phe Ala Val 

195 200 205- 

Val Glu Asn Ser Leu Trp Gin Gly Val Pro Asn Tyr Leu Arg Glu Leu 

210 215 220 

Asn Glu Gin Leu Glu Glu Asn Leu Gly Tyr Lys Leu Pro Val Glu Phe 
225 230 - 235 240 

Val Pro Val Arg Phe Thr Ser Trp Met Gly Gly Asp Arg Asp Gly Asn 

245 250 255 

Pro Asn Val Thr Ala Asp He Thr Arg His Val Leu Leu Leu Ser Arg 

260 265 270 

Trp Lys Ala Thr Asp Leu Phe Leu Lys Asp He Gin Val Leu Val Ser 
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Val Val Asn lie Ala Ser Tyr Gin 

130 135 
lie Arg Glu Lys Ala Lys Lys Gin 
145 150 
Leu Ala Glu Gin Arg Glu Lys Pro 
165 

Lys Met Glu Gly Thr Phe Lys Arg 
180 

Ala Asp lie Asn Glu His Leu lie 
195 200 



Val Ser Pro Asn Asp Val Val Ser 
140 

Ser Arg Val Lys Ala Ala Leu Glu 
155 160 
Thr Trp Leu Glu Val Asp Ala Gly 

170 175 
Lys Pro Glu Arg Ser Asp Leu Ser 
185 190 
Val Glu Leu Tyr Ser Lys 
205 



<210> 4 62 
<211> 129 
<212> PRT 

<213> Escherichia coli 



<400> 462 
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<210> 463 
<211> 118 
<212> PRT 

<213> Escherichia coli 



<400> 463 
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<210> 464 
<211> 460 
<212> PRT 

<213> Escherichia coli 
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Val Glu Ala He Ser Arg Gin Leu He Lys His Arg Ala Ala Gin Gly 

435 440 445 

Glu Ala Val Pro Asp Ala Ala Thr Ala Ala Ser His 
450 455 460 



<210> 465 
<211> 536 
<212> PRT 

<213> Escherichia coli 
<400> 465 

Met Glu He Thr Asn Pro lie Leu Thr Gly Phe Asn Pro Asp Pro Ser 

1 5 10 15 

Leu Cys Arg Gin Gly Glu Asp Tyr Tyr He Ala Thr Ser Thr Phe Glu 

20 25 30 

Trp Phe Pro Gly Val Arg He Tyr His Ser Arg Asp Leu Lys Asn Trp 

35 40 45 

Ser Leu Val Ser Thr Pro Leu Asp Arg Val Ser Met Leu Asp Met Lys 

50 55 60 

Gly Asn Pro Asp Ser Gly Gly He Trp Ala Pro Cys Leu Ser Tyr Ala 
65 70 75 80 

Asp Gly Lys Phe Trp Leu Leu Tyr Thr Asp Val Lys He Val Asp Ser 

85 90 95 

Pro Trp Lys Asn Gly Arg Asn Phe Leu Val Thr Ala Pro Ser He Glu 

100 " " 105 110 

Gly Pro Trp Ser Glu Pro He Pro Met Gly Asn Gly Gly Phe Asp Pro 

115 120 125 

Ser Leu Phe His Asp Asp Asp Gly Arg Lys Tyr Tyr He Tyr Arg Pro 

130 135 140 

Trp Gly Pro Arg His His Ser Asn Pro His Asn Thr He Val Leu Gin 
145 ~ 150 155 160 

Ala Phe Asp Pro Gin Thr Gly Thr Leu Ser Pro Glu Arg Lys Thr Leu 

165 170 175 

Phe Thr Gly Thr Pro Leu Cys Tyr Thr Glu Gly Ala His Leu Tyr Arg 

180 185 190 

His Ala Gly Trp Tyr Tyr Leu Met Ala Ala Glu Gly Gly Thr Ser Tyr 

195 ~ " 200 205 

Glu His Ala Val Val Val Leu Arg Ser Lys Asn He Asp Gly Pro Tyr 

210 215 220 

Glu Leu His Pro Asp Val Thr Met Met Thr Ser Trp His Leu Pro Glu 
225 230 235 240 

Asn Pro Leu Gin Lys Ser Gly His Gly Ser Leu Leu Gin Thr His Thr 

245 250 255 

Gly Glu Trp Tyr Met Ala Tyr Leu Thr Ser Arg Pro Leu Arg Leu Pro 

260 265 270 

Gly Val Pro Leu Leu Ala Ser Gly Gly Arg Gly Tyr Cys Pro Leu Gly 

275 280 285 

Arg Glu Thr Gly He Ala Arg He Glu Trp Arg Asp Gly Trp Pro Tyr 

290 295 300 

Val Glu Gly Gly Lys His Ala Gin Leu Thr Val Lys Gly Pro Gin Val 
305 - - - 31Q 315 32Q 

Ala Glu Gin Pro Ala Ala Val Pro Gly Asn Trp Arg Asp Asp Phe Asp 

325 330 335 

Ala Ser Ser Leu Asp Pro Glu Leu Gin Thr Leu Arg He Pro Phe Asp 

340 345 350 

Asp Thr Leu Gly Ser Leu Thr Ala Arg Pro Gly Phe Leu Arg Leu Tyr 

355 360 365 

Gly Asn Asp Ser Leu Asn Ser Thr Phe Thr Gin Ser Thr Val Ala Arg 

370 375 380 

Arg Trp Gin His Phe Ala Phe Arg Ala Glu Thr Arg Met Glu Phe Ser 
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<210> 466 
<211> 325 
<212> PRT 

<213> Escherichia coli 
<400> 466 
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Asn Tyr Trp Gin His Asp He Pro Ala He Met He Thr Asp Thr Ala 

275 280 285 

Phe Tyr Arg Asn Lys Gin Tyr His Leu Pro Gly Asp Thr Ala Asp Arg 

290 295 ' 300 

Leu Asn Tyr Gin Lys Met Ala Gin Val Val Asp Gly Val He Thr Leu 
305 " 310 315 320 

Leu Tyr Asn Ser Lys 
325 



<210> 467 
<211> 320 
<212> PRT 

<213> Escherichia coli 
<400> 467 

Met Met He Lys Thr Arg Phe Ser Arg Trp Leu Thr Phe Phe Thr Phe 

1 5 10 15 

Ala Ala Ala Val Ala Leu Ala Leu Pro Ala Lys Ala Asn Thr Trp Pro 

20 25 30 

Leu Pro Pro Ala Gly Ser Arg Leu Val Gly Glu Asn Lys Phe His Val 

35 ** 40 45 

Val Glu Asn Asp Gly Gly Ser Leu Glu Ala He Ala Lys Lys Tyr Asn 

50 " 55 60 

Val Gly Phe Leu Ala Leu Leu Gin Ala Asn Pro Gly Val Asp Pro Tyr 
65 70 75 80 

Val Pro Arg Ala Gly Ser Val Leu Thr He Pro Leu Gin Thr Leu Leu 

85 90 95 

Pro Asp Ala Pro Arg Glu Gly He Val He Asn He Ala Glu Leu Arg 

100 105 110 

Leu Tyr Tyr Tyr Pro Pro Gly Lys Asn Ser Val Thr Val Tyr Pro He 

115 120 125 

Gly He Gly Gin Leu Gly Gly Asp Thr Leu Thr Pro Thr Met Val Thr 

130 135 140 

Thr Val Ser Asp Lys Arg Ala Asn Pro Thr Trp Thr Pro Thr Ala Asn 
145 * 150 155 160 

He Arg Ala Arg Tyr Lys Ala Gin Gly He Glu Leu Pro Ala Val Val 

165 170 175 

Pro Ala Gly Leu Asp Asn Pro Met Gly His His Ala He Arg Leu Ala 

180 .185 190 

Ala Tyr Gly Gly Val Tyr Leu Leu His Gly Thr Asn Ala Asp Phe Gly 

195 200 205 

He Gly Met Arg Val Ser Ser Gly Cys He Arg Leu Arg Asp Asp Asp 

210 215 220 

He Lys Thr Leu Phe Ser Gin Val Thr Pro Gly Thr Lys Val Asn He 
225 230 235 240 

He Asn Thr Pro He Lys Val Ser Ala Glu Pro Asn Gly Ala Arg Leu 

245 250 255 

Val Glu Val His Gin Pro Leu Ser Glu Lys He Asp Asp Asp Pro Gin 

260 265 270 

Leu Leu Pro He Thr Leu Asn Ser Ala Met Gin Ser Phe Lys Asp Ala 

275 280 285 

Ala Gin Thr Asp Ala Glu Val Met Gin His Val Met Asp Val Arg Ser 

290 295 300 

Gly Met Pro Val Asp Val Arg Arg His Gin Val Ser Pro Gin Thr Leu 
305 310 315 320 



<210> 468 
<211> 494 
<212> PRT 
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<213> Escherichia coli 
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<213> Artificial Sequence 
<220> 

<223> Linker containing T7 terminator 
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AMENDED CLAIMS 

[received by the International Bureau on 14 January 2002 (14.01.02); 
original claims 1-131 replaced by amended claims 1-131 (14pages)] 

1. A purified or isolated nucleic acid sequence consisting essentially of one the 
sequence of nucleotides of SEQ ED NOs: 1-93, wherein expression of said nucleic acid in a 
microorganism is capable of inhibiting proliferation of a microorganism. 

2. The nucleic acid sequence of Claim 1, wherein said nucleic acid sequence is 
complementary to at least a portion of the nucleotide sequence of the coding strand of a gene whose 
expression is required for proliferation of a microorganism. 

3. The nucleic acid of Claim 1, wherein said nucleic acid sequence has a nucleotide 
seuqence complementary to at least a portion of the nucleotide sequence of an RNA required for 
proliferation of a microorganism. 

4. The nucleic acid of Claim 3, wherein the nucleotide sequence of said RNA encodes 
more than one gene product 

5. A purified or isolated nucleic acid comprising a fragment of one of the nucleotide 
seuqence of SEQ ID NOs.: 1-93, said fragment selected from the group consisting of fragments 
comprising at least 10, at least 20, at least 25, at least 30, at least 50 and more than 50 consecutive 
nucleotides of one of the nucleotide sequences of SEQ ID NOs: 1-93. 

6. A vector comprising a promoter operably linked to the nucleic acid sequence of 
Claims 1,2,3,4, or 5. 

7. The vector of Claim 6, wherein said promoter is active in a microorganism selected 
from the group consisting of Aspergillus fiimigatus, Bacillus anthracis, Burkholderia cepacia, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida 
tropicalis, Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr (also called 
Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella 
paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Klebsiella 
pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, 
Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus epidermidis, 
Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species falling within the 
genera of any of die above species. 

8. A host cell containing die vector of Claim 6. 

9. A purified or isolated nucleic acid consisting essentially of the coding sequence of 

one of SEQ ID NOs: 105-112, 119-122, 134-160, 164-171, 179-265, 271-273, 275, and 279-286. 
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10. A fragment of the nucleic acid of Claim 8, said fragment comprising at least 10, at 
least 20, at least 25, at least 30, at least 50 or more than 50 consecutive nucleotides of one of SEQ ID 
NOs: 105-112.. 119-122, 134-160, 164-171, 179-265,271-273,275, and 279-286. 

11. A vector comprising a promoter operably linked to the nucleic acid of Claim 9 or 
Claim 10. 

12. A purified or isolated antisense nucleic acid comprising a nucleic acid sequence 
complementary to at least a portion of an intragenic sequence, intergenic sequence, sequences 
spanning at least a portion of two or more genes, 5' noncoding region, or 3' noncoding region within 
an operon comprising a proliferation-required gene whose activity or expression is inhibited by an 
antisense nucleic acid comprising one of SEQ ID NOs.: 1-93. 

13. A purified or isolated nucleic acid comprising a nucleic acid having at least 70% 
identity to a sequence selected from the group consisting of SEQ ID NOs.: 1-93, fragments 
comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-93, the sequences complementary 
to SEQ ID NOs.: 1-93 and the sequences complementary to fragments comprising at least- 25 
consecutive nucleotides of SEQ ID NOs.: 1-93 as determined using BLASTN version 2.0 with the 
default parameters. 

14. The nucleic acid of Claim 13, wherein said nucleic acid is from an organism selected 
from the group consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, 
Campylobacter jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida 
tropicalis, Candida parapsilosis, Candida guilliermondii ? Candida krusei, Candida kefyr (also called 
Candida pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coli. Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae, Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella 
paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Klebsiella 
pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, 
Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus epidermidis, 
Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species falling within the 
genera of any of the above species. 

15. A vector comprising a promoter operably linked to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 
NOs.: 1-93. 

16. A host cell containing the vector of Claim 15. 

17. The vector of Claim 15, wherein said polypeptide comprises a polypeptide 
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comprising a sequence selected from the group consisting of SEQ ID NOs: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479. 

18. A purified or isolated polypeptide comprising a polypeptide whose expression is 
inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93, or a fragment selected 
from the group consisting of fragments comprising at least 5, at least 10, at least 20, at least 30, at 
least 40, at least 50, at least 60 or more than 60 consecutive amino acids of one of the said 
polypeptides. 

19. The polypeptide of Claim 18, wherein said polypeptide comprises a polypeptide 
comprising one of SEQ ID NOs.: 298-305, 312-315, 327-353, 357-364, 372-458, 464-466, 468 and 
472-479 or a fragment comprising at least 5. at least 10, at least 20. at least 30, at least 40, at least 50, 
at least 60 or more than 60 consecutive amino acids of a polypeptide comprising a sequence selected 
from the group consisting of SEQ ID NOs.: 298-305, 312-315. 327-353, 357-364, 372-458, 464-466, 
468 and 472-479. 

20. A purified or isolated polypeptide comprising a polypeptide having at least 25% 
identity to a polypeptide whose expression is inhibited by a sequence selected from the group 
consisting of SEQ ID NOs.: 1 -93. or at least 25% identity to a fragment comprising at least 5, at least 
10, at least 20, at least 30. at least 40, at least 50 ? at least 60 or more than 60 consecutive amino acids 
of a polypeptide whose expression is inhibited by a nucleic acid selected from the group consisting of 
SEQ ID NOs.: 1-93 as determined using FASTA version 3.0t7S with the default parameters. 

21. The polypeptide of Claim 20, wherein said polypeptide has at least 25% identity to a 
polypeptide comprising one of SEQ ID NOs: 298-305. 312-315. 327-353, 357-364, 372-458, 464- 
466, 468 and 472-479 or at least 25% identity to a fragment comprising at least 5, at least 10, at least 
20, at least 30, at least 40. at least 50, at least 60 or more than 60 consecutive amino acids of a 
polypeptide comprising one of SEQ ID NOs.: 298-305, 312-315, 327-353, 357-364, 372-458, 464- 
466, 468 and 472-479 as determined using FASTA version 3.0t78 with the default parameters. 

22. An antibody capable of specifically binding the polypeptide of one of Claims 18-21. 

23. A method of producing a polypeptide, comprising introducing a vector comprising a 
promoter operably linked to a nucleic acid encoding a polypeptide whose expression is inhibited by 
an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93 into a cell and expressing said 
polypeptide. 

24. The method of Claim 23, further comprising the step of isolating said polypeptide. 

25. The method of Claim 23, wherein said polypeptide comprises a sequence selected 
from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327-353, 357-364, 372-458, 464-466, 
468 and 472-479. 

26. A method of inhibiting proliferation of a microorganism comprising inhibiting the 
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activity or reducing the amount of a gene product whose expression is inhibited by an antisense 
nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93 or 
inhibiting the activity or reducing the amount of a nucleic acid encoding said gene product. 

27. The method of Claim 26, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464^66, 468 and 472-479. 

28. A method for identifying a compound which influences the activity of a gene product 
required for proliferation, said gene product comprising a gene product whose expression is inhibited 
by an antisense nucleic acid comprising a sequence selected from the group consisting of SEQ ID 
NOs.: 1-93, said method comprising: 

contacting said gene product with a candidate compound; and 

determining whether said compound influences the activity of said gene product. 

29. The method of Claim 2S. wherein said gene product is a polypeptide and said activity, 
is an enzymatic activity. 

30. The method of Claim 2S. wherein said gene product is a polypeptide and said activity 
is a carbon compound catabolism activity. 

3 1 . The method of Claim 2S. wherein said gene product is a polypeptide and said activity 
is a biosynthetic activity. 

32. The method of Claim 28. wherein said gene product is a polypeptide and said activity 
is a transporter activity. 

33. The method of Claim 28. wherein said gene product is a polypeptide and said activity 
is a transcriptional activity. 

34. The method of Claim 28. wherein said gene product is a polypeptide and said activity 
is a DNA replication activity. 

35. The method of Claim 28, wherein said gene product is a polypeptide and said activity 
is a cell division activity. 

36. A compound identified using the method of Claim 28. 

37. The method of Claim 28, wherein said gene product is a polypeptide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327-353, 357-364, 
372-458, 464-466, 468 and 472-479. 

38. A method for identifying a compound or nucleic acid having the ability to reduce the 
activity or level of a gene product required for proliferation, said gene product comprising a gene 
product whose activity or expression is inhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 1-93, said method comprising: 
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(a) providing a target that is a gene or RNA, wherein said target comprises a 
nucleic acid encoding said gene product; 

(b) contacting said target with a candidate compound or nucleic acid; and 

(c) measuring an activity of said target. 

39. The method of Claim 38, wherein said target is a messenger RNA molecule and said 
activity is translation of said messenger RNA. 

40. The method of Claim 38, wherein said target is a messenger RNA molecule and said 
activity is transcription of a gene encoding said messenger RNA. 

41. The method of Claim 38, wherein said target is a gene and said activity is 
transcription of said gene. 

42. The method of Claim 38, wherein said target is a nontranslated RNA and said 
activity is processing or folding of said nontranslated RNA or assembly of said nontranslated RNA 
into a protein/RNA complex. 

43. The method of Claim 3S. wherein said target gene or RNA encodes a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464-466. 468 and 472-479. 

44. A compound or nucleic acid identified using the method of Claim 38. 

45. A method for identifying a compound which reduces the activity or level of a gene 
product required for proliferation of a microorganism, wherein the activity or expression of said gene 
product is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-93. said method comprising the steps of: 

(a) expressing a sub-lethal level of an antisense nucleic acid complementary to a 
nucleic acid encoding said gene product in a cell to reduce the activity or amount of said 
gene product in said cell, thereby producing a sensitized cell; 

(b) contacting said sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized 

cell. 

46. The method of Claim 45, wherein said determining step comprises determining 
whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized cell. 

47. The method of Claim 45, wherein said cell is selected from the group consisting of 
bacterial cells, fungal cells, plant cells, and animal cells. 

48. The method of Claim 45, wherein said cell is a Gram negative bacterium. 

49. The method of Claim 45, wherein said cell is an £. coli cell. 

50. The method of Claim 45, wherein said cell is from an organism selected from the 
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group consisting of Aspergillus fumigatus, Bacillus anthracis, Burkholderia cepacia, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis. Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia colt, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae. Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella 
paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Klebsiella 
pneumoniae, Listeria monocytogenes, Moxarella catarrhalis. Shigella boydii, Shigella dysenteriae, 
Shigella flexneri, Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus epidermidis, 
Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species falling within the 
genera of any of the above species. 

51. • . The method of Claim 45, wherein said antisense nucleic acid is transcribed from an 
inducible promoter. 

52. The method of Claim 51, further comprising the step of contacting said cell with a 
concentration of inducer which induces said antisense nucleic acid to a sub-lethal level. 

53. The method of Claim 45, wherein growth inhibition is measured by monitoring 
optical density of a culture growth solution. 

• 54. The method of Claim 45, wherein said gene product is a polypeptide. 

55. The method of Claim 54, wherein said polypeptide comprises a sequence selected 
fromthe group consisting of SEQ ID NOs.: 298-305, 3 12-315, 327-353, 357-364, 372-458, 464-466, 
468 and 472-479. 

56. The method of Claim 45, wherein said gene product is an RNA. 

57. A compound identified using the method of Claim 45. 

58. A method for inhibiting cellular proliferation comprising introducing a compound 
with activity against a gene whose activity or expression is inhibited by an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93 or a compound with 
activity against the product of said gene into a population of cells expressing said gene. 

59. The method of Claim 58, wherein said compound is an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, or a proliferation- 
inhibiting portion thereof. 

60. The method of Claim 59, wherein said proliferation inhibiting portion of one of SEQ 
ID NOs.: 1-93 is a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or 
more than 5 1 consecutive nucleotides of one of SEQ ID NOs.: 1-93. 
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61. The method of Claim 58, wherein said population is a population selected from the 
group consisting of bacterial cells, fungal cells, plant cells, and animal cells. 

62. The method of Claim 58, wherein said population is a population of Gram negative 
bacteria. 

63. The method of Claim 58. wherein said population is a population of E. coli cells. 

64. The method of Claim 58. wherein said population is a population selected from the 
group consisting of Aspergillus fumigatus. Bacillus anthracis, Burkholderia cepacia, Campylobacter 
jejuni, Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, 
Candida parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida 
pseudotropicalis), Candida dubliniensis. Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae,Listeria monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa. Salmonella cholerasuis, Salmonella enterica, Salmonella 
paratyphi, Salmonella typhi. Salmonella typhimunum. Staphylococcus aureus, Klebsiella 
pneumoniae, Listeria monocytogenes. Moxarella catarrhal is. Shigella boydii, Shigella dysenteriae, 
Shigella flexneri, Shigella sonnei. Pseudomonas aeruginosa. Staphylococcus epidermidis, 
Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species falling within the 
genera of any of the above species. 

65. The method of Claim 58, wherein said gene encodes a polypeptide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 298-305. 312-315, 327-353, 357-364, 
372-458, 464-466, 468 and 472-479. 

66. A preparation comprising an effective concentration of an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, or a proliferation- 
inhibiting portion thereof in a pharmaceutically acceptable carrier. 

67. The preparation of Claim 66, wherein said proliferation-inhibiting portion of one of 
SEQ ID NOs.: 1-93 comprises at least 10. at least 20, at least 25, at least 30, at least 50 or more than 
50 consecutive nucleotides of one of SEQ ID NOs.: 1 -93. 

68. A method for inhibiting the activity or expression of a gene in an operon required for 
proliferation wherein the activity or expression of at least one gene in said operon is inhibited by an 
antisense nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1- 
93, said method comprising contacting a cell in a cell population with an antisense nucleic acid 
comprising at least a proliferation-inhibiting portion of said operon. 

69. The method of Claim 68, wherein said antisense nucleic acid comprises a sequence 
selected from the group consisting of SEQ ID NOs.: 1-93 or a proliferation inhibiting portion thereof. 
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70. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a plasmid which expresses said antisense nucleic acid into said cell population. 

71. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a phage which expresses said antisense nucleic acid into said cell population. 

72. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by expressing said antisense nucleic acid from the chromosome of cells in said cell population. 

73. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a promoter adjacent to a chromosomal copy of said antisense nucleic acid such 
that said promoter directs the synthesis of said antisense nucleic acid. 

74. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a retron which expresses said antisense nucleic acid into said cell population. 

75. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a ribozyme into said cell-population, wherein a binding portion of said ribozyme 
is complementary to said antisense oligonucleotide. 

76. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a liposome comprising said antisense oligonucleotide into said cell. 

77. The method of Claim 6S, wherein said cell is contacted with said antisense nucleic 
acid by electroporation of said antisense nucleic acid. 

78. The method of Claim 68, wherein said antisense nucleic acid is a fragment 
comprising at least 10, at least 20, at least 25, at least 30. at least 50 or more than 50 consecutive 
nucleotides of one of SEQ ID NOs.; 1-93. 

79. The method of Claim 68 wherein said antisense nucleic acid is an oligonucleotide. 

80. A method for identifying a gene which is required for proliferation of a 
microorganism comprising: 

(a) contacting a microorganism other than E. coli with a nucleic acid selected 
from the group consisting of SEQ ID NOs.; 1-93; 

(b) determining whether said nucleic acid inhibits proliferation of said 
microorganism; and 

(c) identifying the gene in said microorganism which is inhibited by said nucleic 

acid. 

81. The method of Claim 80, wherein said microorganism is a Gram negative bacterium. 

82. The method of Claim 80 wherein said microorganism is selected from the group 
consisting of Aspergillus fumigatus. Bacillus anthracis, Burkholderia cepacia, Campylobacter jejuni, 
Candida albicans, Candida glabrata (also called Torulopsis glabrata), Candida tropicalis, Candida 
parapsilosis, Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida 
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pseudotropicalis), Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatis, 
Clostridium botulinum, Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae, Listeria monocytogenes. Mycobacterium leprae, Mycobacterium tuberculosis, 
Neisseria gonorrhoeae, Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, 
Salmonella paratyphi. Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Klebsiella 
pneumoniae, Listeria monocytogenes, Moxarella catarrhalis, Shigella boydii, Shigella dysenteriae, 
Shigella flexneri, Shigella sonnei. Pseudomonas aeruginosa, Staphylococcus epidermidis, 
Streptococcus pneumoniae, Treponema pallidum, Yersinia pestis and any species falling within the 
genera of any of the above species. 

83. The method of Claim 80, further comprising introducing said nucleic acid into a 
vector functional in said microorganism prior to introducing said inhibitory nucleic acid into said 
microorganism. 

84. A method for identifying a compound having the ability to inhibit proliferation of a 
microorganism comprising: 

(a) identifying in a first microorganism a homolog of a gene or gene product 
present in a second microorganism which is different than said first microorganism, wherein 
the activity or level of said gene or gene product is inhibited by a nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs. 1-93 ; 

(b) identifying an inhibitory nucleic acid sequence which inhibits the activity of 
said homolog in said first microorganism; 

(c) contacting said first microorganism with a sub-lethal level of said inhibitory 
nucleic acid, thus sensitizing said first microorganism: 

(d) contacting the sensitized microorganism of step (c) with a compound; and 

(e) determining whether said compound inhibits proliferation of said sensitized 
microorganism. 

85. The method of Claim 84, wherein said determining step comprises determining 
whether said compound inhibits proliferation of said sensitized microorganism to a greater extent 
than said compound inhibits proliferation of a nonsensitized microorganism. 

86. The method of Claim 84 wherein step (a) comprises identifying a homologous 
nucleic acid to a gene or gene product whose activity or level is inhibited by a nucleic acid selected 
from the group consisting of SEQ ID NOs. 1-93 or a nucleic acid encoding a homologous 
polypeptide to a polypeptide whose activity or level is inhibited by a nucleic acid selected from the 
group consisting of SEQ ID NOs. 1-93 by using an algorithm selected from the group consisting of 
BLASTN version 2.0 with the default parameters and FASTA version 3.0t78 algorithm with the 
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default parameters to identify said homologous nucleic acid or said nucleic acid encoding a 
homologous polypeptide in a database. 

87. The method of Claim 84 wherein said step (a) comprises identifying a homologous 
nucleic acid or a nucleic acid encoding a homologous polypeptide by identifying nucleic acids which 
hybridize to said first gene. 

88. The method of Claim 84 wherein the step (a) comprises expressing a nucleic acid 
selected from the group consisting of SEQ ID NOs. 1-93 in said microorganism. 

89. The method of Claim 84, wherein said inhibitory nucleic acid is an antisense nucleic 

acid. 

90. The method of Claim 84, wherein said inhibitory nucleic acid comprises an antisense 
nucleic acid to a portion of said homolog. 

91 . The method of Claim 84, wherein said inhibitory nucleic acid comprises an antisense 
nucleic acid to a portion of the operon encoding said homolog. 

92. The method of Claim 84, wherein the step of contacting the first microorganism with 
a sub-lethal level of said inhibitory nucleic acid comprises directly contacting said microorganism 
with said inhibitory nucleic acid. 

93. The method of Claim 84 ? wherein the step of contacting the first microorganism with 
a sub-lethal level of said inhibitory nucleic acid comprises expressing an antisense nucleic acid to 
said homolog in said microorganism. 

94. The method of Claim 84, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372^58, 464-466, 468 and 472-479. 

95. A compound identified using the method of Claim 84. 

96. A method of identifying a compound having the ability to inhibit proliferation 
comprising: 

(a) contacting a microorganism other than £. coli with a sub-lethal level of a 
nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1-93 
or a portion thereof which inhibits the proliferation of £. coli, thus sensitizing said 
microorganism; 

(b) contacting the sensitized microorganism of step (a) with a compound; and 

(c) determining whether said compound inhibits proliferation of said sensitized 
microorganism. 

97. The method of Claim 96, wherein said determining step comprises determining 
whether said compound inhibits proliferation of said sensitized microorganism to a greater extent 
than said compound inhibits proliferation of a nonsensitized microorganism. 
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98. A compound identified using the method of Claim 96. 

99. A method for identifying a compound having activity against a biological pathway 
required for proliferation comprising: 

(a) sensitizing a cell by expressing a sub-lethal level of an antisense nucleic acid 
complementary to a nucleic acid encoding a gene product required for proliferation, wherein 
the activity or expression of said gene product is inhibited by an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93, in said cell 
to reduce the activity or amount of said gene product; 

(b) contacting the sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized 

cell. 

100. The method of Claim 99. wherein said determining step comprises determining 
whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized cell. 

101. The method of Claim 99. wherein said cell is selected from the group consisting of 
bacterial cells, fungal cells, plant cells, and animal cells. 

102. The method of Claim 99, wherein said cell is a Gram negative bacterium. 

103. The method of Claim 99, wherein said Gram negative bacterium is E. coli. 

104. The method of Claim 99. wherein said cell is selected from the group consisting of 
Aspergillus fumigatus. Bacillus anthracis. Burkholdena cepacia. Campylobacter jejuni, Candida 
albicans, Candida glabrata (also called Torulopsis glabrata). Candida tropicalis, Candida parapsilosis, 
Candida guilliermondii, Candida krusei, Candida kefyr (also called Candida pseudotropicalis), 
Candida dubliniensis, Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, 
Clostridium difficile, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, 
Escherichia coli, Haemophilus influenzae. Helicobacter pylori, Klebsiella pneumoniae, Listeria 
monocytogenes, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, 
Pseudomonas aeruginosa, Salmonella cholerasuis, Salmonella enterica, Salmonella paratyphi, 
Salmonella typhi, Salmonella ryphimurium, Staphylococcus aureus, Klebsiella pneumoniae, Listeria 
monocytogenes, Moxarella catarrhalis. Shigella boydii. Shigella dysenteriae, Shigella flexneri, 
Shigella sonnei, Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus pneumoniae, 
Treponema pallidum. Yersinia pestis and any species falling within the genera of any of the above 
species. 

105. The method of Claim 99, wherein said antisense nucleic acid is transcribed from an 
inducible promoter. 

106. The method of Claim 99, further comprising contacting the cell with an agent which 
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induces expression of said antisense nucleic acid from said inducible promoter, wherein said 
antisense nucleic acid is expressed at a sub-lethal level. 

107. The method of Claim 99, wherein inhibition of proliferation is measured by 
monitoring the optical density of a liquid culture. 

108. The method of Claim 99, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479 

109. A compound identified using the method of Claim 99. 

110. A method for identifying a compound having the ability to inhibit cellular 
proliferation comprising: 

(a) contacting a cell with an agent which reduces the activity or level of a gene 
product required for proliferation of said cell, wherein said gene product is a gene product 

• whose activity or expression is tnhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 1-93; 

(b) . contacting said cell with a compound; and 

(c) determining whether said compound reduces proliferation of said contacted 

cell. 

111. The method of Claim 110. wherein said determining step comprises determining 
whether said compound reduces proliferation of said contacted cell to a greater extent than said 
compound reduces proliferation of cells which have not been contacted with said agent. 

1 12. The method of Claim 1 10, wherein said agent which reduces the activity or level of a 
gene 'product required for proliferation of said cell comprises an antisense nucleic acid to a gene or 
operon required for proliferation. 

113. The method of Claim 1 10, wherein said agent which reduces the activity or level of a 
gene product required for proliferation of said cell comprises a compound known to inhibit growth or 
proliferation of a microorganism. 

114. The method of Claim 1 10, wherein said cell contains a mutation which reduces the 
activity or level of said gene product required for proliferation of said cell. 

115. The method of Claim 1 14, wherein said mutation is a temperature sensitive mutation. 

116. The method of Claim 110, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479 

117. A compound identified using the method of Claim 110. 

118. A method for identifying the biological pathway in which a proliferation-required 
gene or its gene product lies, wherein said gene or gene product comprises a gene or gene product 
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whose activity or expression is inhibited by an antisense nucleic acid comprising a sequence selected 
from the group consisting of SEQ ID NOs.: 1-93, said method comprising: 

(a) expressing a sub-lethal level of an antisense nucleic acid which inhibits the 
activity of said proliferation-required gene or gene product in a cell; 

(b) contacting said cell with a compound known to inhibit growth or 
proliferation of a microorganism, wherein the biological pathway on which said compound 
acts is known; and 

(c) determining whether said cell is sensitive to said compound. 

119. The method of Claim 118, wherein said determining step comprises determining 
whether said cell has a substantially greater sensitivity to said compound than a cell which does not 
express said sub-lethal level of said antisense nucleic acid and wherein said gene or gene product lies 
in the same pathway on which said compound acts if said cell expressing said sub-lethal level of said 
antisense nucleic acid has a substantially greater sensitivity to said compound than said cell which 
does not express said sub-lethal level of said antisense nucleic acid. 

120. The method of Claim 118, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479 

121. A method for determining the biological pathway on which a test compound acts 
comprising: 

(a) expressing a sub-lethal level of an antisense nucleic acid complementary to 
a proliferation-required nucleic acid in a cell, wherein the activity or expression of said 
proliferation-required nucleic acid is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-93 and wherein the 
biological pathway in which said proliferation-required nucleic acid or a protein encoded by 
said proliferation-required polypeptide lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell is sensitive to said test compound. 

122. The method of Claim 121, wherein said determining step comprises determining 
whether said cell has a substantially greater sensitivity to said test compound than a cell which does 
not express said sub-lethal level of said antisense nucleic acid. 

123. The method of Claim 121, further comprising: 

(d) expressing a sub-lethal level of a second antisense nucleic acid 
complementary to a second proliferation-required nucleic acid in a second cell, wherein said 
second proliferation-required nucleic acid is in a different biological pathway than said 
proliferation-required nucleic acid in step (a); and 
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(e) determining whether said second cell does not have a substantially greater 
sensitivity to said test compound than a cell which does not express said sub-lethal level of 
said second antisense nucleic acid, wherein said test compound is specific for the biological 
pathway against which the antisense nucleic acid of step (a) acts if said second cell does not 
have substantially greater sensitivity to said test compound. 

124. A purified or isolated nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-93. 

125. A compound which interacts with a gene or gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-93 to inhibit 
proliferation. 

126. A compound which interacts with a polypeptide whose expression is inhibited by an 
antisense nucleic acid comprising one of SEQ ID NOs.: 1-93 to inhibit proliferation. 

127. A method for manufacturing an antibiotic comprising the steps of: 

screening one or more candidate compounds to identify a compound that reduces 
the activity or level of a gene product required for proliferation, said gene product 
comprising a gene product whose activity or expression is inhibited by an antisense nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-93; and 

manufacturing the compound so identified. 

128. The method of Claim 127, wherein said screening step comprises performing any 
one of the methods of Claims 28, 38, 45, 96, 99 and 1 10. 

129. A method for inhibiting proliferation of a microorganism in a subject comprising 
administering a compound that reduces the activity or level of a gene product required for 
proliferation of said microorganism, said gene product comprising a gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-93 to said subject. 

130. The method of Claim 129 wherein said subject is selected from the group consisting 
of vertebrates, mammals, avians, and human beings. 

131. The method of Claim 129, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 298-305, 312-315, 327- 
353, 357-364, 372-458, 464-466, 468 and 472-479. 
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